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SIMIAN ADENOVIRUS NUCLEIC ACID AND AMINO ACID SEQUENCES, 
VECTORS CONTAINING SAME, AND METHODS OF USE 

BACKGROUND OF THE INVENTION 

Adenovirus is a double-stranded DNA virus with a genome size of about 36 
kilobases (kb), which has been widely used for gene transfer applications due to its ability 
to achieve highly efficient gene transfer in a variety of target tissues and large transgene 
capacity. Conventionally, El genes of adenovirus are deleted and replaced with a 
transgene cassette consisting of the promoter of choice, cDNA sequence of the gene of 
interest and a poly A signal, resulting in a replication defective recombinant virus. 

Adenoviruses have a characteristic morphology with an icosahedral capsid 
consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), 
along with a number of other minor proteins, VI, VIII, IX, Ilia and IVa2 [W.C. Russell, J. 
Gen Virol, 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded 
DNA with a terminal protein attached covalently to the 5' termini, which have inverted 
terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic 
protein VII and a small peptide termed mu. Another protein, V, is packaged with this 
DNA-protein complex and provides a structural link to the capsid via protein VI. The 
virus also contains a virus-encoded protease, which is necessary for processing of some of 
the structural proteins to produce mature infectious virus. 

Recombinant adenoviruses have been described for delivery of molecules to host 
cells. See, US Patent 6,083,716, which describes the genome of two chimpanzee 
adenoviruses. 

What is needed in the art are more effective vectors which avoid the effect of pre- 
existing immunity to selected adenovirus serotypes in the population and/or which are 
useful for repeat administration and for titer boosting by second vaccination, if required. 

Summary of the Invention 

The present invention provides the isolated nucleic acid sequences and amino acid 
sequences of six simian adenoviruses, vectors containing these sequences, and cell lines 
expressing simian adenovirus genes. Also provided are a number of methods for using 
the vectors and cells of the invention. 
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The methods of the invention involve delivering one or more selected 
heterologous gene(s) to a mammalian patient by administering a vector of the invention. 
Because the various vector constructs are derived from simian rather than from human 
adenoviruses, the immune system of the non-simian human or veterinary patient is not 
5 primed to respond immediately to the vector as a foreign antigen. Use of the 

compositions of this invention thus permits a more stable expression of the selected 
transgene when administered to a non-simian patient. Use of the compositions of this 
invention for vaccination permits presentation of a selected antigen for the elicitation of 
protective immune responses. Without wishing to be bound by theory, the ability of the 

10 adenoviruses of the invention to transduce human dendritic cells is at least partially 
responsible for the ability of the recombinant constructs of the invention to induce an 
immune response. The recombinant simian adenoviruses of this invention may also be 
used for producing heterologous gene products in vitro. Such gene products are 
themselves useful in a variety for a variety of purposes such as are described herein. 

15 These and other embodiments and advantages of the invention are described in 

more detail below. 



Brief Description of the Drawings 

Fig. 1 provides an alignment of the amino acid sequences of the LI and a portion 

20 of the L2 loops of the capsid protein hexon of the chimpanzee adenovirus CI [SEQ ID 
NO:13], chimpanzee adenovirus C68 (Pan-9) [SEQ IDNO:14], and the novel Pan5 [SEQ 
ID NO:15], Pan6 [SEQ ID NO: 16] and Pan7 [SEQ ID NO: 17] chimpanzee adenovirus 
sequences of the invention. The intervening conserved region is part of the pedestal 
domain conserved between adenovirus serotypes. 

25 Fig. 2 provides an alignment of the amino acid sequences of the fiber knob 

domains of chimpanzee C68 (Pan-9) [SEQ ID NO: 18], Pan-6 [SEQ ID NO: 19], Pan-7 
[SEQ ID NO:20], and Pan-5 [SEQ ID NO:21] and the human adenoviruses serotypes 2 
[SEQ ID NO:22] and 5 [SEQ ID NO:23]. 

30 DETAILED DESCRIPTION OF THE INVENTION 

The invention provides novel nucleic acid and amino acid sequences from Ad 
Pan5 [SEQ ID NO:l-4, 15 and 21], Ad Pan6 [SEQ ID NO: 5-8, 16, 19], and Ad serotype 

2 
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Pan7 [SEQ ID NO: 9-12, 17, 20], which were originally isolated from chimpanzee lymph 
nodes. In several instances throughout the specification, these adenoviruses are 
alternatively termed herein C5, C6 and C7, respectively. Also provided are sequences 
from adenovirus SV1 [SEQ ID NO: 24-28], which was originally isolated from the 
5 kidney cells of cynomolgus monkey. The invention also provides sequences of 

adenoviruses SV-25 [SEQ ID NO:29-33] and SV-39 [SEQ ID NO: 34-37], which were 
originally isolated from rhesus monkey kidney cells. 

The present invention provides novel adenovirus vectors and packaging cell lines 
to produce those vectors for use in the in vitro production of recombinant proteins or 

10 fragments or other reagents. The invention further provides compositions for use in 

delivering a heterologous molecule for therapeutic or vaccine purposes. Such therapeutic 
or vaccine compositions contain the adenoviral vectors carrying an inserted heterologous 
molecule. In addition, novel sequences of the invention are useful in providing the 
essential helper functions required for production of recombinant adeno-associated viral 

15 (AAV) vectors. Thus, the invention provides helper constructs, methods and cell lines 
which use these sequences in such production methods. 

The term "substantial homology' 5 or "substantial similarity," when referring to a 
nucleic acid or fragment thereof, indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic acid (or its complementary strand), 

20 there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity," when referring to 
amino acids or fragments thereof, indicates that, when optimally aligned with appropriate 
amino acid insertions or deletions with another amino acid (or its complementary strand), 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. 

25 Preferably, the homology is over full-length sequence, or a protein thereof, or a fragment 
thereof which is at least 8 amino acids, or more desirably, at least 15 amino acids in 
length. Examples of suitable fragments are described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic acid 
sequences refers to the residues in the two sequences that are the same when aligned for 

30 maximum correspondence. The length of sequence identity comparison may be over the 
full-length of the genome (e.g., about 36 kbp), the full-length of an open reading frame of 
a gene, protein, subunit, or enzyme [see, e.g., the tables providing the adenoviral coding 
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regions], or a fragment of at least about 500 to 5000 nucleotides, is desired. However, 
identity among smaller fragments, e.g. of at least about nine nucleotides, usually at least 
about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more 
nucleotides, may also be desired. Similarly, "percent sequence identity" may be readily 
5 determined for amino acid sequences, over the full-length of a protein, or a fragment 
thereof. Suitably, a fragment is at least about 8 amino acids in length, and may be up to 
about 700 amino acids. Examples of suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs as 
are defined herein at default settings. Preferably, such identity is over the full length of 

10 the protein, enzyme, subunit, or over a fragment of at least about 8 amino acids in length. 
However, identity may be based upon shorter regions, where suited to the use to which 
the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of publicly 
or commercially available Multiple Sequence Alignment Programs, such as "Clustal W", 

15 accessible through Web Servers on the internet. Alternatively, Vector NTI utilities are 
also used. There are also a number of algorithms known in the art that can be used to 
measure nucleotide sequence identity, including those contained in the programs 
described above. As another example, polynucleotide sequences can be compared using 
Fasta, a program in GCG Version 6.1 . Fasta provides alignments and percent sequence 

20 identity of the regions of the best overlap between the query and search sequences. For 
instance, percent sequence identity between nucleic acid sequences can be determined 
using Fasta with its default parameters (a word size of 6 and the NOP AM factor for the 
scoring matrix) as provided in GCG Version 6.1, herein incorporated by reference. 
Similarly programs are available for performing amino acid alignments. Generally, these 

25 programs are used at default settings, although one of skill in the art can alter these 

settings as needed. Alternatively, one of skill in the art can utilize another algorithm or 
computer program that provides at least the level of identity or alignment as that provided 
by the referenced algorithms and programs. 

As used throughout this specification and the claims, the term "comprise" and its 

30 variants including, "comprises", "comprising", among other variants, is inclusive of other 
components, elements, integers, steps and the like. The term "consists of or "consisting 
of are exclusive of other components, elements, integers, steps and the like. 
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I. The Simian Adenovirus Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Pan5, Pan6, Pan7, SV1, SV25 and SV39, which are isolated from the other viral material 
5 with which they are associated in nature. 

A. Nucleic Acid Sequences 

The Pan5 nucleic acid sequences of the invention include nucleotides 1 to 
36462 of SEQ ID NO:l. The Pan6 nucleic acid sequences of the invention include 
nucleotides 1 to 36604 of SEQ ID NO: 5. The Pan7 nucleic acid sequences of the 
10 invention include nucleotides 1 to 36535 of SEQ ID NO: 9. The SV1 nucleic acid 

sequences of the invention include nucleotides 1 to 34264 of SEQ ID NO: 24. The SV25 
nucleic acid sequences of the invention include nucleotides 1 to 31044 of SEQ ID NO: 
29. The SV39 nucleic acid sequences of the invention include nucleotides 1 to 341 15 of 
SEQ ID NO: 34. See, Sequence Listing, which is incorporated by reference herein. 
1 5 The nucleic acid sequences of the invention further encompass the 

strand which is complementary to the sequences of SEQ ID NO: 5, 9, 24, 29 and 34, 
as well as the RNA and cDNA sequences corresponding to the sequences of these 
sequences figures and their complementary strands. Further included in this invention 
are nucleic acid sequences which are greater than 95 to 98%, and more preferably 
20 about 99 to 99.9% homologous or identical to the Sequence Listing. Also included in 
the nucleic acid sequences of the invention are natural variants and engineered 
modifications of the sequences provided in SEQ ID NO: 5, 9, 24, 29 and 34 and their 
complementary strands. Such modifications include, for example, labels that are 
known in the art, methylation, and substitution of one or more of the naturally 
2 5 occurring nucleotides with a degenerate nucleotide. 

The invention further encompasses fragments of the sequences of 
Pan5, Pan6, Pan7, SV1, SV25 and SV39, their complementary strand, cDNA and 
RNA complementary thereto. Suitable fragments are at least 1 5 nucleotides in length, 
and encompass functional fragments, i.e., fragments which are of biological interest. 
30 For example, a functional fragment can express a desired adenoviral product or may 
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be useful in production of recombinant viral vectors. Such fragments include the gene 
sequences and fragments listed in the tables below. 

The following tables provide the transcript regions and open reading frames 
in the simian adenovirus sequences of the invention. For certain genes, the transcripts 
5 and open reading frames (ORPs) are located on the strand complementary to that 
presented in SEQ ID NO: 5, 9, 24, 29 and 34. See, e.g., E2b, E4 and E2a. The 
calculated molecular weights of the encoded proteins are also shown. Note that the 
Ela open reading frame Pan5 [nt 576-1436 of SEQ ID NO: 1], Pan6 [nt576to 1437 of 
SEQ ID NO: 5] and Pan7 [nt 576 to 1437 of SEQ ID NO: 9] contain internal splice 
10 sites. These splice sites are noted in the following tables. 
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AdPan-5 [SEQ IDNO:l] 



Regions 


Start 


End 


MW 

lvl. YY . 






(nt) 


(nt) 


(uaitonsj 


ITR 


1 


120 




Ela 


Transcript 


478 








13S 


576-664, 


1233- 
1436 


28120 




12S 


576-1046, 1233- 


24389 






1436 








AO 

9S 


576-644,1233- 










ItJO 






Transcript 




1 51fi 

l J 1 D 




Elb 


Transcript 


1 J J Z- 








Small T 


1 ^QO 


? 1 71 






Large T 


1904 








IX 




3Q9fl 


1 4427 




Transcript 




J7J7 




E2b 


Transcript • 










PTP 


10349 


OtJ 1 


72930 




Polymerase 


8448 


5083 


I27237 




IVa2 


5604 


3980 


50466 

*s \J ~ \J\J 




Transcript 




3960 




28.1 kD 


5155 


5979 


281 4 J 


Agnoprotein 


7864 


8580 


25755 


LI 


Transcript 


10849 








52/55D 


10851 


12025 






ma 


12050 


13819 


6 5 669 




Transcript 




13832 






Transcript 


13894 






L2 


Penton 


13898 


15490 


59292 




VII 


15494 


16078 


21478 




V 


16123 


17166 


39568 




Mu 


17189 


17422 


8524 




transcript 




17442 






Transcript 


17488 






L3 


VI j 


17491 


18222 


26192 




Hexon 


18315 


21116 


104874 




Endoprotease 


20989 


21783 


28304 




transcript 




21811 
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Ad Pan-5 (cont'd) [SEQ ID NO:l] 


Regions 


Start 










(nt) 

V 1 * 1 -/ 


(tit) 


(Dalton c) 

1 J—' l/» H v / to / 


E2a 


Transcript 


26782 








DBP 


23386 


21845 


573JS 




transcript 




21788 




T A 


Transcript 


234U6 




- 




1UUKJJ 


23412 


Of OAC 

25805 


88223 




ku nomolog 


23D25 


26356 


24538 




\7TTf 
VJJ1 


26425 


1 1 

271 1 l 


24768 




transcript 




27421 




E3 


Transcript 


26788 




- 




r\^r lit 

Ori#l 


27H2 


27432 


12098 




Urt #2 


27386 


28012 


23040 




Ori #i 


27994 


28527 


19525 




Orf #4 


28557 


29156 


22567 




Orf#5 


2Q1 6Q 


9Q7R3 


/,/.zo / 




Orf #6 


29798 


30673 


1 USA 




Orf#7 


30681 


30956 


10 All 




Orf #8 


30969 


313Q6 


1 OJZJ 




Orf #9 


31389 


31796 

Jl / 7U 


l JZJO 




transcrint 




3 1 837 




L5 


Trail corint 










Pihpr 




jjj /Z 


4/O/U 




transcript 








E4 


Transcript 


36135 








Orf 7 


33710 


33462 


9191 




Orf 6 


34615 


33710 


35005 




Orf 4 


34886 


34521 


13878 




Orf 3 


35249 


34896 


13641 




Orf 2 


35635 


35246 


14584 




Orf 1 


36050 


35676 


13772 




Transcript 




33437 




ITR 




36343 


36462 
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AdPan-6 [SEQIDNO:5] 



Regions 


Start (nt) 


End 
(nt) 


M.W. 
(Daltons) 


ITR 


1 


123 


- 


Ela 


transcript 


478 




- 




13S 


576-1143, 1229- 
1437 


j 28291 




inn 

12c> 


576-1050, 1229- 
1437 


24634 




no 


576-645, 1229- 
1437 


10102 




ucinbcnpi 




1516 


- 


Flh 


tl*Q n con v>f- 

uanbcripL 


1553 




- 




omau 1 


1600 


2172 


22315 




Large 1 


1905 


3413 


55594 






3498 


3926 


14427 




transcript 




3965 


- 


E2b 


transcript 


10341 




- 




PTP 


10340 


8451 


72570 




Polymerase 


8445 


5089 






IVa2 


5610 


3986 






transcript 




3966 




LI 


transcript 


10838 








52/55 kD 


10840 


12012 


44205 




Ilia 


12036 


13799 


65460 




Transcript 




13812 


- 


28.1 kd 


5161 


5985 


28012 


Agnop 


rotein 


7870 


8580 


25382 


L2 


transcript j 


13874 




- 




Penton 


13878 


15467 


59314 




VII 


15471 


16055 


21508 




V 


16100 


17137 


39388 




Mu 


17160 


17393 


8506 




transcript 




17415 




L3 


transcript 


17466 








VI 


17469 


18188 


25860 




Hexon 


18284 


21112 


106132 




Endoprotease 


21134 


21754 


23445 




transcript 




21803 




E2a 


transcript 


26780 








DBP 1 


23375 


21837 


57299 




transcript 




21780 
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Ad Pan-6 (cont'd) [SEQ ID NO:5] 



Regions 


Start (nt)' 


End 
(nt) 


M.W. 
(Daltons) 


L4 


Transcript 


23398 






1UUKD 


23404 


25806 


88577 


33 kD homolog 


25523 


26357 


24609 


vm 


26426 


27109 


24749 


transcript 




27419 


- 


E3 


transcript 


26786 




- 


Orf#l 


27110 


27430 


12098 


Orf#2 


27384 


28007 


22880 


Orf#3 


27989 


28519 


L. 19460 


Orf#4 


28553 


29236 


25403 


Orf#5 


29249 


29860 


22350 


Orf#6 


29875 


30741 


31028 


Orf#7 


30749 


31024 


10469 


Orf#8 


31030 


31464 


16540 


Orf#9 


31457 


31864 


'15264 


transcript 




31907 


- 


L5 


transcript 


32159 






Fiber 


32162 


33493 


47364 


transcript 




33574 


- 


E4 


transcript 


36276 




- 


Orf7 


33841 


33593 


9177 


Orf6 


34746 


33841 


35094 


Orf4 


35017 


34652 


13937 


Orf3 


35380 


35027 


13627 


Orf2 


35766 


35377 


14727 


Orf 1 


36181 


35807 


13739 


transcript 




33558 




ITR 




36482 


36604 





5 
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AdPan-7 [SEQIDNO:9] 



Regions 


Start (nt) 


End (nt) 


M.W. (Daltons) 


ITR 




I 


132 




Ela 


transcript 


478 






13S 


576-H43, 1229-1437 


28218 


12S 


576-1050, 1229- 1437 


24561 


9S 


576-645, 
1229-1437 


10102 


transcript 




1516 




Elb 


transcript 


1553 






Small T 


1600 


2178 




LargeT 


1905 


3419 


jjoyo 


IVa2 


3992 


5616 


JXJZl u 


transcript 




3971 




E2b 


transcript 


10341 






PTP 


10340 


8457 


72297 


Polymerase 


8451 


5095 


126994 


IX 


3504 


3932 


14441 


transcript 




3972 




28.1kD 


5167 


5991 


28028 


Agnoprotein 


7876 


8586 


25424 


LI 


transcript 


10834 






52/55 kD 


10836 


I20ll 


44302 


Ilia 


12035 


13795 


65339 


transcript 




13808 


- 


L2 


transcript 


13870 




- 


Penton 


13874 


15469 


59494 


VII 


15473 


16057 


21339 


V 


16102 


17139 


39414 


Mu 


17167 


17400 


8506 


transcript 




17420 


- 


L3 


transcript 


17467 






VI 


17470 


18198 


26105 


Hexon 


18288 


21086 


104763 


Endoprotease 


21106 


21 732 


23620 


transcript 




21781 




E2a 


transcript 


26764 






DBP 


23353 


21815 


57199 


transcript 




21755 
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Ad Pan-7 (cont'd) [SEQ ID NO: 9] 


Regions 


Start (nt) 


End (nt) 


\ X TXT /T\ _ 7 j. _ \ 

MM. (Daltons) 


L4 


transcript 


23370 




- 




lOOkD 


23376 


25781 


88520 




33 kD 
homolog 


25489 


26338 


25155 




vrn 


26410 


27093 


24749 




transcript 




27403 


- 


E3 


transcript 


26770 




- 




Orf#l 


27094 


27414 


12056 




Orf#2 


27368 


27988 


22667 




Orf#3 


27970 


28500 


19462 




Orf#4 


28530 


29150 


22999 




Orf#5 


29163 


29777 


22224 




Orf#6 


29792 


30679 


32153 




Orf#7 


30687 


30962 


10511 




Orf#8 


30968 


31399 


16388 




Orf#9 


31392 


31799 


15205 




transcript 




31842 


- 


L5 


transcript 


32091 




- 




Fiber 


32094 


33425 


47344 




transcript 




33517 


- 


E4 


transcript 


36208 




- 




Orf7 


33784 


33536 


9191 




Orf6 


34689 


33784 


35063 




Orf4 


34960 


34595 


13879 




Orf3 


35323 


34970 


13641 




Orf2 


35709 


35320 


14644 




Orf 1 


36123 


35749 


13746 




transcript 




33501 




ITR 




36404 


36535 





5 
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Ad SV-1 
[SEQIDNO:24] 


Ad SV-25 
[SEQIDNO.29] 


Ad SV-39 
[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


Ela 


352 


1120 


- 


- 


404 


1409 


Elb 


1301 


2891 


359 


2273 


1518 


3877 


E2b 


9257 


2882 


9087 


2754 


10143 


3868 


E2a 


24415 


20281 


24034 


20086 


25381 


21228 


E3 


24974 


27886 


24791 


25792 


25790 


29335 


E4 


33498 


30881 


30696 


28163 


33896 


31157 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 






Ad SV-1 
[SEQ IDNO:24] 


Ad SV-25 
[SEQ IDNO:29] 


Ad SV-39 
[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


LI 


9513 


12376 


9343 


12206 


10416 


13383 


L2 


12453 


15858 


12283 


15696 


13444 


16877 


L3 


15910 


20270 


15748 


20080 


17783 


21192 


L4 


21715 


25603 


21526 


25420 


22659 


26427 


L5 


28059 


30899 


25320 


28172 


29513 


31170 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 
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protein 


Ad SV-l,SEQIDNO:24 






Start 


End 


M.W. 


ITR 




1 


106 


- 


Ela 


13S 


459 


953 


18039 


12S 








Elb 


Small T 








LargeT 


1301 


2413 


42293 


DC 


2391 


2885 


16882 


E2b 


IVa2 


4354 


2924 


54087 


Polymerase 


6750 


4027 


102883 


PTP 


9257 


7371 


72413 




Agno-protein 


6850 


7455 


20984 


LI 


52/55 kD 


9515 


10642 


42675 


Ilia 


10663 


12372 


636568 


L2 


Penton 


12454 


13965 


56725 


VII 


13968 


14531 


20397 


V 


14588 


15625 


39374 


Mu 


15645 


15857 


7568 


L3 


VI 


15911 


16753 


30418 


Hexon 


16841 


19636 


104494 


Endoprotease 


19645 


20262 


23407 


2a 


DBP 


21700 


20312 


52107 


L4 


lOOkD 


21721 


24009 


85508 


VIII 


24591 


25292 


25390 
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protein 


Ad SV-1 (cont'd) 
SEQIDNO: 24 






Start 


End 


MM. 


E3 


Orf#l 


25292 


25609 


11950 




Orf#2 


25563 


26081 


18940 




Orf#3 


26084 


26893 


30452 




Orf#4 


26908 


27180 


10232 




Orf#5 


27177 


17512 


12640 




Orf#6 


27505 


27873 


13639 


L5 


Fiber #2 


28059 


29150 


39472 




Fiber #1 


29183 


30867 


61128 


E4 


Orf7 


31098 


30892 


7837 




Orf6 


31982 


31122 


33921 




Orf4 


32277 


31915 


14338 




Orf3 


32629 


32279 


13386 




Orf2 


33018 


32626 


14753 




Orf 1 


33423 


33043 


14301 


ITR 




34145 


34264 
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protein 


Ad SV-25, SEQIDNO:29 


Ad SV-39, SEQIDNO:34 






Start 


End 


M.W. 


Start 


End 


M.W. 


ITR 




1 


133 


- 


1 


150 


- 


Ela 


13S 








492 


1355 


28585 


12S 








492 


1355 


25003 


Elb 


Small T 


478 


1030 


20274 


1518 


2075 


21652 


Large T 


829 


2244 


52310 


1823 


3349 


55534 


DC 


2306 


2716 


13854 


3434 


3844 


14075 


E2b 


IVa2 


4208 


2755 


54675 


3912 


5141 


46164 


Poly- 
merase 


6581 


3858 


102839 


7753 


5033 


103988 


FTP 


9087 


7207 


71326 


10143 


8335 


69274 




Agno- 
protein 


6681 


7139 


16025 


- 


- 


- 


LI 


52/55 kD 


9345 


10472 


42703 


10418 


11608 


44232 


Ilia 


10493 


12202 


63598 


11574 


13364 


66078 


L2 


Penton 


12284 


13801 


56949 


13448 


14959 


56292 


vn 


13806 


14369 


20369 


14960 


15517 


20374 


V 


14426 


15463 


39289 


15567 


16628 


39676 


Mu 


15483 


15695 


7598 


16650 


16871 


7497 


L3 


VI 


15749 


16591 


30347 


16925 


17695 


28043 


Hexon 


16681 


19446 


104035 


17785 


20538 


102579 


Endo- 
protease 


19455 


20072 ! 


23338 


20573 


21181 


22716 


2a 


DBP 


21511 


20123 1 


52189 


22631 


21231 


53160 


L4 


lOOkD 


21532 


23829 


85970 


22659 


25355 


100362 


VIII 


24408 


25109 


25347 


25410 


26108 


25229 
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protein 


Ad SV-25, SEQIDNO:29 
(cont'd) 


Ad SV-39, SEQ ID NO:34, 
(cont'd) 






Start 


End 


M.W. 


Start 


End 


M.W. 


E3 


Orf#l 


25109 


25426 


11890 


26375 


27484 


42257 


Orf#2 








27580 


28357 


29785 


Orf#3 








28370 


28645 


10514 


Orf#4 








28863 


29333 


18835 


Orf#5 














Orf#6 














L5 


Fiber #2 


25380 


26423 


37529 










Fiber #1 


26457 


28136 


60707 


29515 


31116 


56382 


E4 


Orf7 








31441 


31118 


11856 


Orf6 


29255 


28395 


33905 


32292 


31438 


33437 


Orf4 


29550 


29188 


14399 


32587 


32222 


13997 


Orf3 


29902 


29552 


13284 


32954 


32607 


13353 


Orf2 


30291 


29899 


14853 


33348 


32959 


14821 


Orf 1 


30316 


30696 


14301 


33764 


33378 


14235 


ITR 




30912 


31044 




33966 


34115 





The Pan5, Pan6, Pan7, SV1, SV25 and SV39 adenoviral nucleic acid 
5 sequences are useful as therapeutic agents and in construction of a variety of vector 
systems and host cells. As used herein, a vector includes any suitable nucleic acid 
molecule including, naked DNA, a plasmid, a virus, a cosmid, or an episome. These 
sequences and products may be used alone or in combination with other adenoviral 
sequences or fragments, or in combination with elements from other adenoviral or non- 
10 adenoviral sequences. The adenoviral sequences of the invention are also useful as 

antisense delivery vectors, gene therapy vectors, or vaccine vectors. Thus, the invention 
further provides nucleic acid molecules, gene delivery vectors, and host cells which 
contain the Ad sequences of the invention. 
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For example, the invention encompasses a nucleic acid molecule 
containing simian Ad ITR sequences of the invention. In another example, the invention 
provides a nucleic acid molecule containing simian Ad sequences of the invention 
encoding a desired Ad gene product. Still other nucleic acid molecule constructed using 
5 the sequences of the invention will be readily apparent to one of skill in the art, in view of 
the information provided herein. 

In one embodiment, the simian Ad gene regions identified herein may be 
used in a variety of vectors for delivery of a heterologous molecule to a cell. For 
example, vectors are generated for expression of an adenoviral capsid protein (or 
10 fragment thereof) for purposes of generating a viral vector in a packaging host cell. Such 
vectors may be designed for expression in trans. Alternatively, such vectors are designed 
to provide cells which stably contain sequences which express desired adenoviral 
functions, e.g., one or more of El a, El b, the terminal repeat sequences, E2a, E2b, E4, 
E40RF6 region. 

15 In addition, the adenoviral gene sequences and fragments thereof are 

useful for providing the helper functions necessary for production of helper-dependent 
viruses (e.g., adenoviral vectors deleted of essential functions or adeno-associated viruses 
(AAV)). For such production methods, the simian adenoviral sequences of the invention 
are utilized in such a method in a manner similar to those described for the human Ad. 

20 However, due to the differences in sequences between the simian adenoviral sequences of 
the invention and those of human Ad, the use of the sequences of the invention essentially 
eliminate the possibility of homologous recombination with helper functions in a host cell 
carrying human Ad El functions, e.g., 293 cells, which may produce infectious 
adenoviral contaminants during rAAV production. 

25 Methods of producing rAAV using adenoviral helper functions have been 

described at length in the literature with human adenoviral serotypes. See, e.g., US Patent 
6,258,595 and the references cited therein. See, also, US Patent 5,871,982; WO 
99/14354; WO 99/15685; WO 99/47691. These methods may also be used in production 
of non-human serotype AAV, including non-human primate AAV serotypes. The simian 

30 adenoviral gene sequences of the invention which provide the necessary helper functions 
(e.g., Ela, Elb, E2a and/or E4 ORF6) can be particularly useful in providing the 
necessary adenoviral function while minimizing or eliminating the possibility of 
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recombination with any other adenoviruses present in the rAAV-packaging cell which are 
typically of human origin. Thus, selected genes or open reading frames of the adenoviral 
sequences of the invention may be utilized in these rAAV production methods. 

Alternatively, recombinant adenoviral simian vectors of the invention may 
5 be utilized in these methods. Such recombinant adenoviral simian vectors may include, 
e.g., a hybrid chimp Ad/AAV in which chimp Ad sequences flank a rAAV expression 
cassette composed of, e.g., AAV 3' and/or 5' ITRs and a transgene under the control of 
regulatory sequences which control its expression. One of skill in the art will recognize 
that still other simian adenoviral vectors and/or gene sequences of the invention will be 
useful for production of rAAV and other viruses dependent upon adenoviral helper. 

In still another embodiment, nucleic acid molecules are designed for 
delivery and expression of selected adenoviral gene products in a host cell to achieve a 
desired physiologic effect. For example, a nucleic acid molecule containing sequences 
encoding an adenovirus El a protein of the invention may be delivered to a subject for use 
as a cancer therapeutic. Optionally, such a molecule is formulated in a lipid-based earner 
and preferentially targets cancer cells. Such a formulation may be combined with other 
cancer therapeutics (e.g., cisplatin, taxol, or the like). Still other uses for the adenoviral 
sequences provided herein will be readily apparent to one of skill in the art. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules. For example, the Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 
simian Ad genomes of the invention can be utilized in a variety of rAd and non-rAd 
vector systems. Such vectors systems may include, e.g., plasmids, lentiviruses, 
retroviruses, poxviruses, vaccinia viruses, and adeno-associated viral systems, among 
others. Selection of these vector systems is not a limitation of the present invention! 

The invention further provides molecules useful for production of the 
simian and simian -derived proteins of the invention. Such molecules which carry 
polynucleotides including the simian Ad DNA sequences of the invention can be in the 
form of naked DNA, a plasmid, a virus or any other genetic element. 
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B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above adenoviruses, 
such as proteins, enzymes, and fragments thereof, which are encoded by the adenoviral 
nucleic acids of the invention. The invention further encompasses Pan5, Pan6 and Pan7, 
5 SV1, SV25 and SV39 proteins, enzymes, and fragments thereof, having the amino acid 
sequences encoded by these nucleic acid sequences which are generated by other 
methods. Such proteins include those encoded by the open reading frames identified in 
the tables above, in Figs. 1 and 2, and fragments thereof 

Thus, in one aspect, the invention provides unique simian adenoviral 
10 proteins which are substantially pure, i.e., are free of other viral and proteinaceous 

proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 60% 
homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived capsid 
proteins. As used herein, a simian-derived capsid protein includes any adenoviral capsid 
15 protein that contains a Pan5, Pan6, Pan7, SV1, SV25 or SV39 capsid protein or a 
fragment thereof, as defined above, including, without limitation, chimeric capsid 
proteins, fusion proteins, artificial capsid proteins, synthetic capsid proteins, and 
recombinantly capsid proteins, without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more Pan5, 
20 Pan6, Pan7, SV1, SV25 or SV39 regions or fragments thereof (e.g., a hexon, penton, fiber 
or fragment thereof) in combination with capsid regions or fragments thereof of different 
adenoviral serotypes, or modified simian capsid proteins or fragments, as described 
herein. A "modification of a capsid protein associated with altered tropism" as used 
herein includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or 
25 fragment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad serotypes 
which may be of human or non-human origin. Such Ad may be obtained from a variety of 
sources including the ATCC, commercial and academic sources, or the sequences of the 
30 Ad may be obtained from GenBank or other suitable sources. 

The amino acid sequences of the simian adenoviruses penton proteins of 
the invention are provided herein. The AdPanS penton protein is provided in SEQ ED 
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NO:2. The AdPan7 penton is provided in SEQ ID NO:6. The AdPan6 penton is 
provided in SEQ ID NO:10. The SV1 penton is provided in SEQ ID NO:25. The SV25 
penton protein is provided in SEQ ID NO:30. The SV39 penton is provided in SEQ ID 
NO:35. Suitably, any of these penton proteins, or unique fragments thereof, may be 
5 utilized for a variety of purposes. Examples of suitable fragments include the penton 
having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino 
acids, based upon the amino acid numbering provided above and in SEQ ID NO:2; SEQ 
ID NO:6; SEQ ID NO:25; SEQ ID NO:30, or SEQ ID NO:35. Other suitable fragments 
include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein 
10 may be modified for a variety of purposes known to those of skill in the art. 

The invention further provides the amino acid sequences of the hexon 
protein of Pan5 [SEQ ID NO:3], Pan6 [SEQ ID NO:7], Pan 7 [SEQ ID NO:l 1], SV1 
[SEQ ID NO:26], SV25 [SEQ ID NO:3 1], and/or SV39 [SEQ ID NO:36]. Suitably, this 
hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. 
15 Examples of suitable fragments include the hexon having N-terminal and/or C-terminal 
truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the 
amino acid numbering provided above and in SEQ ID NO: 3, 7, 1 1, 26, 31 and 36. Other 
suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For 
example, one suitable fragment the loop region (domain) of the hexon protein, designated 
20 DEI and FG 1 , or a hypervariable region thereof. Such fragments include the regions 
spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, 
such as those spanning about residue 1 38 to residue 1 63; about 170 to about 1 76; about 
195 to about 203; about 233 to about 246; about 253 to about 264; about 287 to about 
297; and about 404 to about 430 of the simian hexon proteins, with reference to SEQ ID 
25 NO: 3, 7, 1 1, 26, 3 1 or 36. Other suitable fragments may be readily identified by one of 
skill in the art. Further, the hexon protein may be modified for a variety of purposes 
known to those of skill in the art. Because the hexon protein is the determinant for 
serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses 
having artificial serotypes. Other artificial capsid proteins can also be constructed using 
the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments 
thereof. 
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In one example, it may be desirable to generate an adenovirus having an 
altered hexon protein utilizing the sequences of a hexon protein of the invention. One 
suitable method for altering hexon proteins is described in US Patent 5,922,315, which is 
incorporated by reference. In this method, at least one loop region of the adenovirus 
5 hexon is changed with at least one loop region of another adenovirus serotype. Thus, at 
least one loop region of such an altered adenovirus hexon protein is a simian Ad hexon 
loop region of the invention (e.g. Pan7). In one embodiment, a loop region of the Pan7 
hexon protein is replaced by a loop region from another adenovirus serotype. In another 
embodiment, the loop region of the Pan7 hexon is used to replace a loop region from 

10 another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from 
among human and non-human serotypes, as described herein. Pan7 is selected for 
purposes of illustration only; the other simian Ad hexon proteins of the invention may be 
similarly altered, or used to alter another Ad hexon. The selection of a suitable serotype 
is not a limitation of the present invention. Still other uses for the hexon protein 

15 sequences of the invention will be readily apparent to those of skill in the art. 

The invention further encompasses the fiber proteins of the simian 
adenoviruses of the invention. The fiber protein of AdPan 5 has the amino acid sequence 
of SEQ ID NO:4. The fiber protein AdPan6 has the amino acid sequence of SEQ ID NO: 
8. The fiber protein of AdPan7 has the amino acid sequence of SEQ ID NO: 12. SV-1 

20 has two fiber proteins; fiber 2 has the amino acid sequence of SEQ ID NO:27 and fiber 1 
has the amino acid sequence of SEQ ID NO:28. SV-25 also has two fiber proteins; fiber 
2 has the amino acid sequence of SEQ ID NO:32 and fiber 1 has the amino acid sequence 
of SEQ ID NO:33. The fiber protein of S V-39 has the amino acid sequence of SEQ ID 
NO:37. 

25 Suitably, this fiber protein, or unique fragments thereof, may be utilized 

for a variety of purposes. One suitable fragment is the fiber knob, which spans about 
amino acids 247 to 425 of SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. See Fig. 2. Examples 
of other suitable fragments include the fiber having N-terminal and/or C-terminal 
truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid 

30 numbering provided above and in SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. Still other 
suitable fragments include internal fragments. Further, the fiber protein may be modified 
using a variety of techniques known to those of skill in the art. 
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The invention further encompasses unique fragments of the proteins of the 
invention which are at least 8 amino acids in length. However, fragments of other desired 
lengths can be readily utilized. In addition, the invention encompasses such 
modifications as may be introduced to enhance yield and/or expression of a Pan5, Pan6, 
5 Pan7, SV1, SV25 or SV39 gene product, e.g., construction of a fusion molecule in which 
all or a fragment of the Pan5, Pan6, Pan7, SV1 , SV25 or SV39 gene product is fused 
(either directly or via a linker) with a fusion partner to enhance. Other suitable 
modifications include, without limitation, truncation of a coding region (e.g., a protein or 
enzyme) to eliminate a pre- or pro-protein ordinarily cleaved and to provide the mature 

10 protein or enzyme and/or mutation of a cod ing region to provide a secretable gene 

product. Still other modifications will be readily apparent to one of skill in the art. The 
invention further encompasses proteins having at least about 95% to 99% identity to the 
Pan5, Pan6, Pan7, SV1, SV25 or SV39 proteins provided herein. 

As described herein, vectors of the invention containing the adenoviral 

15 capsid proteins of the invention are particularly well suited for use in applications in 
which the neutralizing antibodies diminish the effectiveness of other Ad serotype based 
vectors, as well as other viral vectors. The rAd vectors of the invention are particularly 
advantageous in readministration for repeat gene therapy or for boosting immune 
response (vaccine titers). 

20 Under certain circumstances, it may be desirable to use one or more of the 

Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 gene products (e.g., a capsid protein or a 
fragment thereof) to generate an antibody. The term "an antibody," as used herein, refers 
to an immunoglobulin molecule which is able to specifically bind to an epitope. Thus, 
the antibodies of the invention bind, preferably specifically and without cross-reactivity, 

25 to a Pan5, Pan6, Pan7, SV1, SV25 or SV39 epitope. The antibodies in the present 
invention exist in a variety of forms including, for example, high affinity polyclonal 
antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant 
antibodies and humanized antibodies. Such antibodies originate from immunoglobulin 
classes IgG, IgM, IgA, IgD and IgE. 

30 Such antibodies may be generated using any of a number of methods 

know in the art. Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Kohler and Milstein and the many known modifications thereof. 
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Similarly desirable high titer antibodies are generated by applying known recombinant 
techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, 
e.g., PCT Patent Application No. PCT/GB85/00392; British Patent Application 
Publication No. GB2188638A; Amit et al, 1986 Science, 233:141-153; Queen et al, 
5 1989 Proc. Natl Acad Sci. USA, 86:\ 0029-10033; PCT Patent Application No. 

PCT/WO9007861; and Riechmann etal, Nature, 332:323-321 (1988); Huse etal, 1988a 
Science, 246:1275-1281]. Alternatively, antibodies can be produced by manipulating the 
complementarity determining regions of animal or human antibodies to the antigen of this 
invention. See, e.g., E. Mark and Padlin, " Human ization of Monoclonal Antibodies", 

10 Chapter 4, The Handbook of Experimental Pharmacology, Vol. 113, The Pharmacology 
of Monoclonal Antibodies, Springer- Verlag (June, 1994);, Harlow et al, 1999, Using 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et 
al, 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; Houston et 
al, 1988, Proc. Natl Acad. Set USA 55:5879-5883; and Bird et al, 1988, Science 

15 242:423-426. Further provided by the present invention are anti-idiotype antibodies 
(Ab2) and anti-anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et al., 
"Modulation of anti-tumor immunity by anti-id iotypic antibodies. 11 In Idiotypic Network 
and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. Soc. Microbiol, Washington 
DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced 

20 using techniques well known to those of skill in the art. These antibodies may be used for 
a variety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a detectable 
label or a tag onto a Pan5, Pan6, Pan7, SV1, SV25 or SV39 gene product, antibody or 
other construct of the invention. As used herein, a detectable label is a molecule which is 

25 capable, alone or upon interaction with another molecule, of providing a detectable signal. 
Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example, suitable 
labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin 
(APC), coriphosphine-0 (CPO) or tandem dyes, PE-cyanin-5 (PCS), and PE-Texas Red 

30 (ECD). All of these fluorescent dyes are commercially available, and their uses known to 
the art. Other useful labels include a colloidal gold label. Still other useful labels include 
radioactive compounds or elements. Additionally, labels include a variety of enzyme 
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systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase 
(which uses glucose as a substrate) releases peroxide as a product which in the presence 
of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an 
oxidized TMB that is seen as a blue color. Other examples include horseradish 
5 peroxidase (HRP) or alkaline phosphatase (AP), and hexokinase in conjunction with 

glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, 
among other products, NADH that is detected as increased absorbance at 340 nm 
wavelength. 

Other label systems that are utilized in the methods of this invention are 

10 detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, 

Indiana] in which a dye is embedded are used in place of enzymes to form conjugates 
with the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 

Methods for coupling or associating the label with a desired molecule are 

15 similarly conventional and known to those of skill in the art. Known methods of label 
attachment are described [see, for example, Handbook of Fluorescent probes and 
Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 
1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, 
Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, selection of the label and 

20 coupling methods do not limit this invention. 

The sequences, proteins, and fragments of the invention may be produced 
by any suitable means, including recombinant production, chemical synthesis, or other 
synthetic means. Suitable production techniques are well known to those of skill in the 
art. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring 

25 Harbor Press (Cold Spring Harbor, NY). Alternatively, peptides can also be synthesized 
by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc, 
85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San 
Francisco, 1969) pp. 27-62). These and other suitable production methods are within the 
knowledge of those of skill in the art and are not a limitation of the present invention. 

30 In addition, one of skill in the art will readily understand that the Ad 

sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
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immunogenic molecules. For example, in one embodiment, the simian Ad capsid 
proteins and other simian adenovirus proteins described herein are used for non-viral, 
protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and 
immunogenic molecules. In one such embodiment, a protein of the invention is linked, 
5 directly or indirectly, to a molecule for targeting to cells with a receptor for adenoviruses. 
Preferably, a capsid protein such as a hexon, penton, fiber or a fragment thereof having a 
ligand for a cell surface receptor is selected for such targeting. Suitable molecules for 
delivery are selected from among the therapeutic molecules described herein and their 
gene products. A variety of linkers including, lipids, polyLys, and the like may be 

10 utilized as linkers. For example, the simian penton protein may be readily utilized for 
such a purpose by production of a fusion protein using the simian penton sequences in a 
manner analogous to that described in Medina-Kauwe LK, et al, Gene Ther. 2001 May; 
8(10):795-803 and Medina-Kauwe LK, et al, Gene Tlier. 2001 Dec; 8(23): 1753-1761. 
Alternatively, the amino acid sequences of simian Ad protein IX may be utilized for 

15 targeting vectors to a cell surface receptor, as described in US Patent Appln 

20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or 
polylysine-containing sequence, and the like. Still other simian Ad proteins, including, 
e.g., the hexon protein and/or the fiber protein, may be used for used for these and similar 
purposes. 

20 Still other adenoviral proteins of the invention may be used as alone, or in 

combination with other adenoviral protein, for a variety of purposes which will be readily 
apparent to one of skill in the art. In addition, still other uses for the adenoviral proteins 
of the invention will be readily apparent to one of skill in the art. 

25 EL Recombinant Adenoviral Vectors 

The compositions of this invention include vectors that deliver a heterologous 
molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector 
may include any genetic element including, without limitation, naked DNA, a phage, 
transposon, cosmid, episome, plasmid, or a virus. Such vectors contain simian 

30 adenovirus DNA of Pan5, Pan6, Pan 7, SV1, SV25 and/or SV39 and a minigene. By 
"minigene" is meant the combination of a selected heterologous gene and the other 
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regulatory elements necessary to drive translation, transcription and/or expression of the 
gene product in a host cell. 

Typically, an adenoviral vector of the invention is designed such that the 
minigene is located in a nucleic acid molecule which contains other adenoviral sequences 
5 in the region native to a selected adenoviral gene. The minigene may be inserted into an 
existing gene region to disrupt the function of that region, if desired. Alternatively, the 
minigene may be inserted into the site of a partially or fully deleted adenoviral gene. For 
example, the minigene may be located in the site of such as the site of a functional El 
deletion or functional E3 deletion, among others that may be selected. The term 
10 "functionally deleted" or "functional deletion" means that a sufficient amount of the gene 
region is removed or otherwise damaged, e.g., by mutation or modification, so that the 
gene region is no longer capable of producing functional products of gene expression. If 
desired, the entire gene region may be removed. Other suitable sites for gene disruption or 
deletion are discussed elsewhere in the application. 
15 For example, for a production vector useful for generation of a recombinant virus, 

the vector may contain the minigene and either the 5' end of the adenoviral genome or the 
3 5 end of the adenoviral genome, or both the 5' and 3' ends of the adenoviral genome. 
The 5' end of the adenoviral genome contains the 5' cis-elements necessary for packaging 
and replication; i.e., the 5' inverted terminal repeat (ITR) sequences (which functions as 
2 0 origins of replication) and the native 5' packaging enhancer domains (that contain 

sequences necessary for packaging linear Ad genomes and enhancer elements for the El 
promoter). The 3' end of the adenoviral genome includes the V cis-elements (including 
the ITRs) necessary for packaging and encapsidation. Suitably, a recombinant adenovirus 
contains both 5 ? and 3' adenoviral cis-elements and the minigene is located between the 
5 9 and 3* adenoviral sequences. Any adenoviral vector of the invention may also contain 
additional adenoviral sequences. 

Suitably, these adenoviral vectors of the invention contain one or more adenoviral 
elements derived from an adenoviral genome of the invention. In one embodiment, the 
vectors contain adenoviral ITRs from Pan5, Pan6, Pan7, SV1, SV25 or SV39 and 
additional adenoviral sequences from the same adenoviral serotype. In another 
embodiment, the vectors contain adenoviral sequences that are derived from a different 
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adenoviral serotype than that which provides the ITRs. As defined herein, a pseudotyped 
adenovirus refers to an adenovirus in which the capsid protein of the adenovirus is from a 
different serotype than the serotype which provides the ITRs. The selection of the 
serotype of the ITRs and the serotype of any other adenoviral sequences present in vector 
5 is not a limitation of the present invention. A variety of adenovirus strains are available 
from the American Type Culture Collection, Manassas, Virginia, or available by request 
from a variety of commercial and institutional sources. Further, the sequences of many 
such strains are available from a variety of databases including, e.g., PubMed and 
GenBank. Homologous adenovirus vectors prepared from other simian or from human 

10 adenoviruses are described in the published literature [see, for example, US Patent No. 
5,240,846]. The DNA sequences of a number of adenovirus types are available from 
GenBank, including type Ad5 [GenBank Accession No. M73260]. The adenovirus 
sequences may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 
4, 7, 12 and 40, and further including any of the presently identified human types. 

15 Similarly adenoviruses known to infect non-human animals (e.g., simians) may also be 
employed in the vector constructs of this invention. See, e.g., US Patent No. 6,083,716. 

The viral sequences, helper viruses, if needed, and recombinant viral particles, 
and other vector components and sequences employed in the construction of the vectors 
described herein are obtained as described above. The DNA sequences of the Pan5, Pan6, 

20 Pan7, SV1, SV25 and/or SV39 simian adenovirus sequences of the invention are 

employed to construct vectors and cell lines useful in the preparation of such vectors. 

Modifications of the nucleic acid sequences forming the vectors of this invention, 
including sequence deletions, insertions, and other mutations may be generated using 
standard molecular biological techniques and are within the scope of this invention. 

25 A The "Minigene" 

The methods employed for the selection of the transgene, the cloning and 
construction of the "minigene" and its insertion into the viral vector are within the skill in 
the art given the teachings provided herein. 

1. The transgene 

30 The transgene is a nucleic acid sequence, heterologous to the 

vector sequences flanking the transgene, which encodes a polypeptide, protein, or other 
product, of interest. The nucleic acid coding sequence is operatively linked to regulatory 
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components in a manner which permits transgene transcription, translation, and/or 
expression in a host cell. 

The composition of the transgene sequence will depend upon the 
use to which the resulting vector will be put. For example, one type of transgene 
5 sequence includes a reporter sequence, which upon expression produces a detectable 
signal. Such reporter sequences include, without limitation, DNA sequences encoding 0- 
lactamase, p-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green 
fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, 
membrane bound proteins including, for example, CD2, CD4, CD8, the influenza 

10 hemagglutinin protein, and others well known in the art, to which high affinity antibodies 
directed thereto exist or can be produced by conventional means, and fusion proteins 
comprising a membrane bound protein appropriately fused to an antigen tag domain from, 
among others, hemagglutinin or Myc. These coding sequences, when associated with 
regulatory elements which drive their expression, provide signals detectable by 

15 conventional means, including enzymatic, radiographic, colorimetric, fluorescence or 
other spectrograph ic assays, fluorescent activating cell sorting assays and immunological 
assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ 
gene, the presence of the vector carrying the signal is detected by assays for beta- 

20 galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the 
signal may be measured visually by color or light production in a luminometer. 

However, desirably, the transgene is a non-marker sequence encoding a 
product which is useful in biology and medicine, such as proteins, peptides, RNA, 
enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, 

25 ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA 
sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence 
in the treated animal. 

The transgene may be used for treatment, e.g., of genetic deficiencies, as a 
cancer therapeutic or vaccine, for induction of an immune response, and/or for 

30 prophylactic vaccine purposes. As used herein, induction of an immune response refers 
to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral 
immune response to the molecule. The invention further includes using multiple 
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transgenes, e.g., to correct or ameliorate a condition caused by a multi-subunit protein. In 
certain situations, a different transgene may be used to encode each subunit of a protein, 
or to encode different peptides or proteins. This is desirable when the size of the DNA 
encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived 
5 growth factor, or a dystrophin protein. In order for the cell to produce the multi-subunit 
protein, a cell is infected with the recombinant virus containing each of the different 
subunits. Alternatively, different subunits of a protein may be encoded by the same 
transgene. In this case, a single transgene includes the DNA encoding each of the 
subunits, with the DNA for each subunit separated by an internal ribozyme entry site 

10 (IRES). This is desirable when the size of the DNA encoding each of the subunits is 

small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five 
kilobases. As an alternative to an IRES, the DNA may be separated by sequences 
encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L. 
Donnelly, et al, J. Gen. Virol, 78(Pt 1):13-21 (Jan 1997); Furler, S., etal, Gene Then, 

15 8(ll):864-873 (June 2001); Klump H., etal, Gene Ther., 8(10):81 1-817 (May 2001). 
This 2A peptide is significantly smaller than an IRES, making it well suited for use when 
space is a limiting factor. However, the selected transgene may encode any biologically 
active product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in the art. The 

20 selection of the transgene is not considered to be a limitation of this invention. 
2. Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the vector also includes conventional control elements necessary which are 
operably linked to the transgene in a manner that permits its transcription, translation 

25 and/or expression in a cell transfected with the plasmid vector or infected with the virus 
produced by the invention. As used herein, "operably linked" sequences include both 
expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to control the gene of interest. 

Expression control sequences include appropriate transcription 

30 initiation, termination, promoter and enhancer sequences; efficient RNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus 
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sequence); sequences that enhance protein stability; and when desired, sequences that 
enhance secretion of the encoded product. A great number of expression control 
sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the art and may be utilized. 
5 Examples of constitutive promoters include, without limitation, 

the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV 
enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) 
[see, e.g., Boshart etal, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate 
reductase promoter, the P-actin promoter, the phosphoglycerol kinase (PGK) promoter, 

10 and the EFla promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression and can 
be regulated by exogenously supplied compounds, environmental factors such as 
temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 

15 promoters and inducible systems are available from a variety of commercial sources, 

including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have 
been described and can be readily selected by one of skill in the art. For example, 
inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and 
the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. 

20 Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the 
ecdysone insect promoter [No et al, Proc. Natl Acad. Sci. USA, 93:3346-3351 (1996)], 
the tetracycline-repressible system [Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547- 
5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 268:1766-1769 
(1995), see also Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 (1998)]. Other systems 

25 include the FK506 dimer, VP 16 or p65 using castradiol, diphenol murislerone, the 

RU486-inducible system [Wang et al, Nat. Biotech, 15:239-243 (1 997) and Wang et al, 
Gene Tlier., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. 
Invest, 100:2865-2872 (1997)]. The effectiveness of some inducible promoters 
increases over time. In such cases one can enhance the effectiveness of such systems by 

30 inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES. 

Alternatively, one can wait at least 3 days before screening for the desired function. One 
can enhance expression of desired proteins by known means to enhance the effectiveness 
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of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 

In another embodiment, the native promoter for the transgene will 
be used. The native promoter may be preferred when it is desired that expression of the 
5 transgene should mimic the native expression. The native promoter may be used when 
expression of the transgene must be regulated temporally or developmental^, or in a 
tissue-specific manner, or in response to specific transcriptional stimuli. In a further 
embodiment, other native expression control elements, such as enhancer elements, 
polyadenylation sites or Kozak consensus sequences may also be used to mimic the 
10 native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal p-actin, myosin light chain 2A, dystrophin, 

15 muscle creatine kinase, as well as synthetic muscle promoters with activities higher than 
naturally occurring promoters (see Li et al, Nat Biotech, 17:241-245 (1999)). Examples 
of promoters that are tissue-specific are known for liver (albumin, Miyatake et al, J. 
Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig et al, Gene Ther., 
3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al, Hum. Gene Ther., 7:1503-14 

20 (1996)), bone osteocalcin (Stein etal, Mol Biol Rep., 24:185-96 (1997)); bone 

sialoprotein (Chen et al, J. Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, 
Hansal etal., J. Immunol, 161:1063-8 (1998); immunoglobulin heavy chain; T cell 
receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et 
al, Cell Mol Neurobiol, 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 

25 al, Proc. Natl Acad. Set USA, 88:561 1-5 (1991)), and the neuron-specific vgf gene 
(Piccioli et al, Neuron, 15:373-84 (1 995)), among others. 

Optionally, vectors carrying transgenes encoding therapeutically useful or 
immunogenic products may also include selectable markers or reporter genes may include 
sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such 

30 selectable reporters or marker genes (preferably located outside the viral genome to be 
packaged into a viral particle) can be used to signal the presence of the plasmids in 
bacterial cells, such as ampicillin resistance. Other components of the vector may include 
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an origin of replication. Selection of these and other promoters and vector elements are 
conventional and many such sequences are available [see, e.g., Sambrook et al, and 
references cited therein]. 

These vectors are generated using the techniques and sequences provided 
5 herein, in conjunction with techniques known to those of skill in the art. 

Such techniques include conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping oligonucleotide 
sequences of the adenovirus genomes, polymerase chain reaction, and any suitable 
10 method which provides the desired nucleotide sequence. 

HI. Production of the Recombinant Viral Particle 

In one embodiment, the simian adenoviral plasmids (or other vectors) are used to 
produce recombinant adenoviral particles. In one embodiment, the recombinant 

15 adenoviruses are functionally deleted in the Ela or Elb genes, and optionally bearing 
other mutations, e.g., temperature-sensitive mutations or deletions in other genes. In 
other embodiments, it is desirable to retain an intact Ela and/or Elb region in the 
recombinant adenoviruses. Such an intact El region may be located in its native location 
in the adenoviral genome or placed in the site of a deletion in the native adenoviral 

20 genome (e.g., in the E3 region). 

In the construction of useful simian adenovirus vectors for delivery of a gene to 
the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can 
be employed in the vectors. For example, all or a portion of the adenovirus delayed early 
gene E3 may be eliminated from the simian adenovirus sequence which forms a part of 

25 the recombinant virus. The function of simian E3 is believed to be irrelevant to the 
function and production of the recombinant virus particle. Simian adenovirus vectors 
may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and 
more desirably because of the redundancy in the function of this region, the entire E4 
region. Still another vector of this invention contains a deletion in the delayed early gene 

30 E2a. Deletions may also be made in any of the late genes LI through L5 of the simian 
adenovirus genome. Similarly, deletions in the intermediate genes IX and IVa 2 may be 
useful for some purposes. Other deletions may be made in the other structural or non- 
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structural adenovirus genes. The above discussed deletions may be used individually, 
i.e., an adenovirus sequence for use in the present invention may contain deletions in only 
a single region. Alternatively, deletions of entire genes or portions thereof effective to 
destroy their biological activity may be used in any combination. For example, in one 
5 exemplary vector, the adenovirus sequence may have deletions of the El genes and the 
E4 gene, or of the El, E2a and E3 genes, or of the El and E3 genes, or of El, E2a and E4 
genes, with or without deletion of E3, and so on. As discussed above, such deletions may 
be used in combination with other mutations, such as temperature-sensitive mutations, to 
achieve a desired result. 

10 An adenoviral vector lacking any essential adenoviral sequences (e.g., Ela, El b, 

E2a, E2b, E4 ORF6, LI, L2, L3, L4 and L5) may be cultured in the presence of the 
missing adenoviral gene products which are required for viral infectivity and propagation 
of an adenoviral particle. These helper functions may be provided by culturing the 
adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or 

15 virus) or a packaging host cell. See, for example, the techniques described for preparation 
of a "minimal" human Ad vector in International Patent Application W096/13597, 
published May 9, 1996, and incorporated herein by reference. 
1. Helper Viruses 

Thus, depending upon the simian adenovirus gene content of the 

20 viral vectors employed to cany the minigene, a helper adenovirus or non-replicating virus 
fragment may be necessary to provide sufficient simian adenovirus gene sequences 
necessary to produce an infective recombinant viral particle containing the minigene. 
Useful helper viruses contain selected adenovirus gene sequences not present in the 
adenovirus vector construct and/or not expressed by the packaging cell line in which the 

25 vector is transfected. In one embodiment, the helper virus is replication-defective and 
contains a variety of adenovirus genes in addition to the sequences described above. 
Such a helper virus is desirably used in combination with an El -expressing cell line. 

Helper viruses may also be formed into poly-cation conjugates as 
described in Wu et al, J. Biol. Chem., 264:16985-16987 (1989); K. J. Fisher and J. M. 

30 Wilson, Biochem. J., 299:49 (April 1, 1994). Helper virus may optionally contain a 
second reporter minigene. A number of such reporter genes are known to the art. The 
presence of a reporter gene on the helper virus which is different from the transgene on 
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the adenovirus vector allows both the Ad vector and the helper virus to be independently 
monitored. This second reporter is used to enable separation between the resulting 
recombinant virus and the helper virus upon purification. 

2. Complementation Cell Lines 
5 To generate recombinant simian adenoviruses (Ad) deleted in any 

of the genes described above, the function of the deleted gene region, if essential to the 
replication and infectivity of the virus, must be supplied to the recombinant virus by a 
helper virus or cell line, i.e., a complementation or packaging cell line. In many 
circumstances, a cell line expressing the human El can be used to transcomplement the 

10 chimp Ad vector. This is particularly advantageous because, due to the diversity between 
the chimp Ad sequences of the invention and the human AdEl sequences found in 
currently available packaging cells, the use of the current human El -containing cells 
prevents the generation of replication-competent adenoviruses during the replication and 
production process. However, in certain circumstances, it will be desirable to utilize a 

15 cell line which expresses the El gene products can be utilized for production of an El- 
deleted simian adenovirus. Such cell lines have been described. See, e.g., US Patent 
6,083,716. 

If desired, one may utilize the sequences provided herein to 
generate a packaging cell or cell line that expresses, at a minimum, the adenovirus El 

20 gene from Pan5, Pan6, Pan7, SV1, SV25 or SV39 under the transcriptional control of a 
promoter for expression in a selected parent cell line. Inducible or constitutive promoters 
may be employed for this purpose. Examples of such promoters are described in detail 
elsewhere in this specification. A parent cell is selected for the generation of a novel cell 
line expressing any desired AdPanS, Pan6, Pan7, SV1, SV25 or SV39 gene. Without 

25 limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 
[ATCC Accession No. CCL 185], HEK 293, KB [CCL 17], Detroit [e.g, Detroit 510, 
CCL 72] and WI-38 [CCL 75] cells, among others. These cell lines are all available from 
the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 
201 10-2209. Other suitable parent cell lines may be obtained from other sources. 

30 Such El -expressing cell lines are useful in the generation of 

recombinant simian adenovirus El deleted vectors. Additionally, or alternatively, the 
invention provides cell lines that express one or more simian adenoviral gene products, 
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e.g., El a, Elb 5 E2a, and/or E4 ORF6, can be constructed using essentially the same 
procedures for use in the generation of recombinant simian viral vectors. Such cell lines 
can be utilized to transcomplement adenovirus vectors deleted in the essential genes that 
encode those products, or to provide helper functions necessary for packaging of a helper- 
5 dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to 
this invention involves techniques such as assembly of selected DNA sequences. This 
assembly may be accomplished utilizing conventional techniques. Such techniques 
include cDNAand genomic cloning, which are well known and are described in 
Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the 

10 adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and 
any other suitable methods which provide the desired nucleotide sequence. 

In still another alternative, the essential adenoviral gene products 
are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a 
suitable host cell can be selected from any biological organism, including prokaryotic 

15 (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, 
10T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 
[Fallaux, FJ et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L cells, HT1 080, 

20 HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals 
including human, monkey, mouse, rat, rabbit, and hamster. The selection of the 
mammalian species providing the cells is not a limitation of this invention; nor is the type 
of mammalian cell, i.e., fibroblast hepatocyte, tumor cell, etc. 

3 . Assembly of Viral Particle and Transfection of a Cell Line 

25 Generally, when delivering the vector comprising the minigene by 

transfection, the vector is delivered in an amount from about 5 jig to about 100 jig DNA, 
and preferably about 10 to about 50 jig DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, 
and preferably about 10 5 cells. However, the relative amounts of vector DNA to host 
cells may be adjusted, taking into consideration such factors as the selected vector, the 

30 delivery method and the host cells selected. 

The vector may be any vector known in the art or disclosed above, 
including naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 
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Introduction into the host cell of the vector may be achieved by any means known in the 
art or as disclosed above, including transfection, and infection. One or more of the 
adenoviral genes may be stably integrated into the genome of the host cell, stably 
expressed as episomes, or expressed transiently. The gene products may all be expressed 
5 transiently, on an episome or stably integrated, or some of the gene products may be 
expressed stably while others are expressed transiently. Furthermore, the promoters for 
each of the adenoviral genes may be selected independently from a constitutive promoter, 
an inducible promoter or a native adenoviral promoter. The promoters may be regulated 
by a specific physiological state of the organism or cell (i.e., by the differentiation state or 

10 in replicating or quiescent cells) or by exogenously-added factors, for example. 

Introduction of the molecules (as plasmids or viruses) into the host 
cell may also be accomplished using techniques known to the skilled artisan and as 
discussed throughout the specification. In preferred embodiment, standard transfection 
techniques are used, e.g., CaP04 transfection or electroporation. 

15 Assembly of the selected DNA sequences of the adenovirus (as 

well as the transgene and other vector elements into various intermediate plasmids, and 
the use of the plasmids and vectors to produce a recombinant viral particle are all 
achieved using conventional techniques. Such techniques include conventional cloning 
techniques of cDNA such as those described in texts [Sambrook et al, cited above], use of 

20 overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain 
reaction, and any suitable method which provides the desired nucleotide sequence. 
Standard transfection and co-transfection techniques are employed, e.g., CaP0 4 
precipitation techniques. Other conventional methods employed include homologous 
recombination of the viral genomes, plaquing of viruses in agar overlay, methods of 

25 measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
minigene-containing viral vector, the vector is transfected in vitro in the presence of a 
helper virus into the packaging cell line. Homologous recombination occurs between the 
helper and the vector sequences, which permits the adenovirus-transgene sequences in the 

30 vector to be replicated and packaged into virion capsids, resulting in the recombinant viral 
vector particles. The current method for producing such virus particles is transfection- 
based. However, the invention is not limited to such methods. 
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The resulting recombinant simian adenoviruses are useful in transferring a 
selected transgene to a selected cell. In in vivo experiments with the recombinant virus 
grown in the packaging cell lines, the El -deleted recombinant simian adenoviral vectors 
of the invention demonstrate utility in transferring a transgene to a non-simian, preferably 
5 a human, cell. 

IV. Use of the Recombinant Adenovirus Vectors 

The recombinant simian adenovirus vectors of the invention are useful for gene 
transfer to a human or non-simian veterinary patient in vitro, ex vivo, and in vivo. 

10 The recombinant adenovirus vectors described herein can be used as expression 

vectors for the production of the products encoded by the heterologous genes in vitro. 
For example, the recombinant adenoviruses containing a gene inserted into the location of 
an El deletion may be transfected into an El -expressing cell line as described above. 
Alternatively, replication-competent adenoviruses may be used in another selected cell 

15 line. The transfected cells are then cultured in the conventional manner, allowing the 
recombinant adenovirus to express the gene product from the promoter. The gene 
product may then be recovered from the culture medium by known conventional methods 
of protein isolation and recovery from culture. 

A Pan5, Pan6, Pan7, SV1, SV25 or SV39-derived recombinant simian adenoviral 

20 vector of the invention provides an efficient gene transfer vehicle that can deliver a 

selected transgene to a selected host cell in vivo or ex vivo even where the organism has 
neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV 
and the cells are mixed ex vivo; the infected cells are cultured using conventional 
methodologies; and the transduced cells are re-infused into the patient. These 

25 compositions are particularly well suited to gene delivery for therapeutic purposes and for 
immunization, including inducing protective immunity. 

More commonly, the Pan 5, Pan6, Pan7, SV1, SV25, or SV39 recombinant 
adenoviral vectors of the invention will be utilized for delivery of therapeutic or 
immunogenic molecules, as described below. It will be readily understood for both 

30 applications, that the recombinant adenoviral vectors of the invention are particularly well 
suited for use in regimens involving repeat delivery of recombinant adenoviral vectors. 
Such regimens typically involve delivery of a series of viral vectors in which the viral 
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capsids are alternated. The viral capsids may be changed for each subsequent 
administration, or after a pre-selected number of administrations of a particular serotype 
capsid (e.g., one, two, three, four or more). Thus, a regimen may involve delivery of a 
rAd with a first simian capsid, delivery with a rAd with a second simian capsid, and 
5 delivery with a third simian capsid. A variety of other regimens which use the Ad capsids 
of the invention alone, in combination with one another, or in combination with other Ad 
serotypes will be apparent to those of skill in the art. Optionally, such a regimen may 
involve administration of rAd with capsids of other non-human primate adenoviruses, 
human adenoviruses, or artificial serotypes such as are described herein. Each phase of 
10 the regimen may involve administration of a series of injections (or other delivery routes) 
with a single Ad serotype capsid followed by a series with another Ad serotype capsid. 
Alternatively, the recombinant Ad vectors of the invention may be utilized in regimens 
involving other non-adenoviral-mediated delivery systems, including other viral systems, 
non-viral delivery systems, protein, peptides, and other biologically active molecules. 
15 The following sections will focus on exemplary molecules which may be 

delivered via the adenoviral vectors of the invention. 

A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the above-described recombinant vectors are 
administered to humans according to published methods for gene therapy. A simian viral 
20 vector bearing the selected transgene may be administered to a patient, preferably 

suspended in a biologically compatible solution or pharmaceutically acceptable delivery 
vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous 
isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions 
known to be pharmaceutically acceptable carriers and well known to those of skill in the 
art may be employed for this purpose. 

The simian adenoviral vectors are administered in sufficient amounts to 
transduce the target cells and to provide sufficient levels of gene transfer and expression 
to provide a therapeutic benefit without undue adverse or with medically acceptable 
physiological effects, which can be determined by those skilled in the medical arts. 
Conventional and pharmaceutically acceptable routes of administration include, but are 
not limited to, direct delivery to the retina and other intraocular delivery methods, direct 
delivery to the liver, inhalation, intranasal, intravenous, intramuscular, intratracheal, 
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subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. 
Routes of administration may be combined, if desired, or adjusted depending upon the 
transgene or the condition. The route of administration primarily will depend on the 
nature of the condition being treated. 
5 Dosages of the viral vector will depend primarily on factors such as the 

condition being treated, the age, weight and health of the patient, and may thus vary 
among patients. For example, a therapeutically effective adult human or veterinary 
dosage of the viral vector is generally in the range of from about 100 |iL to about 100 mL 
of a carrier containing concentrations of from about 1 x 10 6 to about 1 x 10 15 particles, 

10 about 1 x 10 n to 1 x 10 13 particles, or about 1 x 10 9 to lx 10 12 particles virus. Dosages 
will range depending upon the size of the animal and the route of administration. For 
example, a suitable human or veterinary dosage (for about an 80 kg animal) for 
intramuscular injection is in the range of about 1 x 10 9 to about 5 x 10 12 particles per mL, 
for a single site. Optionally, multiple sites of administration may be delivered. In another 

15 example, a suitable human or veterinary dosage may be in the range of about 1 x 10 u to 
about 1 x 10 15 particles for an oral formulation. One of skill in the art may adjust these 
doses, depending the route of administration, and the therapeutic or vaccinal application 
for which the recombinant vector is employed. The levels of expression of the transgene, 
or for an immunogen, the Level of circulating antibody, can be monitored to determine the 

20 frequency of dosage administration. Yet other methods for determining the timing of 
frequency of administration will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the patient, 
either concurrently with, or before or after administration of the viral vector, of a suitable 
amount of a short acting immune modulator. The selected immune modulator is defined 

25 herein as an agent capable of inhibiting the formation of neutralizing antibodies directed 
against the recombinant vector of this invention or capable of inhibiting cytolytic T 
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with 
the interactions between the T helper subsets (T H i or Th2) and B cells to inhibit 
neutralizing antibody formation. Alternatively, the immune modulator may inhibit the 

30 interaction between T m cells and CTLs to reduce the occurrence of CTL elimination of 
the vector. A variety of useful immune modulators and dosages for use of same are 
disclosed, for example, in Yang et aL, J. Virol. > 70(9) (Sept., 1996); International Patent 
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Application No. WO96/12406, published May 2, 1996; and International Patent 
Application No.PCT/US96/03035, all incorporated herein by reference. 
1 . Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
5 hormones and growth and differentiation factors including, without limitation, insulin, 
glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing 
factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human 
chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), 

10 connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic 
fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth 
factor (TGF ) 9 platelet-derived growth factor (PDGF), insulin growth factors I and II 
(IGF-I and IGF-II), any one of the transforming growth factor superfamily, including 
TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any 

15 one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth 
factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), 
neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line 
derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 
semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, 

20 noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), interleukins (IL) IL-l through IL-25 (including, e.g., IL-2, IL-4, 
EL- 12 and IL-l 8), monocyte chemoattractant protein, leukemia inhibitory factor, 

25 granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors 
and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the 
immune system are also useful in the invention. These include, without limitation, 
immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized 
antibodies, single chain antibodies, T cell receptors, chimeric T cell receptors, single 

30 chain T cell receptors, class I and class II MHC molecules, as well as engineered 

immunoglobulins and MHC molecules. Useful gene products also include complement 
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regulatory proteins such as complement regulatory proteins, membrane cofactor protein 
(MCP), decay accelerating factor (DAF), CR1, CF2 and CD59. 

Still other useful gene products include any one of the receptors 
for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and 
5 immune system proteins. The invention encompasses receptors for cholesterol regulation, 
including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) 
receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. 
The invention also encompasses gene products such as members of the steroid hormone 
receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin 

10 D receptors and other nuclear receptors. In addition, useful gene products include 

transcription factors such asjunjos, max, mad, serum response factor (SRF), AP-1, AP2, 
myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, 
ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, 
interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, 

15 GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

Other useful gene products include, carbamoyl synthetase I, 
ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, glucose-6- 
20 phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta- 
synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA 
dehydrogenase, propionyl CoA carboxylase, methyl malonyl Co A mutase, glutaryl Co A 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, 
phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis 
transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence. 

Other useful gene products include non-naturally occurring 
polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring 
amino acid sequence containing insertions, deletions or amino acid substitutions. For 
example, single-chain engineered immunoglobulins could be useful in certain 
immunocompromised patients. Other types of non-naturally occurring gene sequences 
include antisense molecules and catalytic nucleic acids, such as ribozymes, which could 
be used to reduce overexpression of a target. 
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Reduction and/or modulation of expression of a gene are 
particularly desirable for treatment of hyperproliferative conditions characterized by 
hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells 
5 as compared to normal cells. Target antigens include polypeptides encoded by oncogenes 
such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trie and 
EGRF. In addition to oncogene products as target antigens, target polypeptides for 
anti-cancer treatments and protective regimens include variable regions of antibodies 
made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas 

10 which, in some embodiments, are also used as target antigens for autoimmune disease. 
Other tumor-associated polypeptides can be used as target polypeptides such as 
polypeptides which are found at higher levels in tumor cells including the polypeptide 
recognized by monoclonal antibody 17-1 A and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those 

15 which may be useful for treating individuals suffering from autoimmune diseases and 

disorders by conferring a broad based protective immune response against targets that are 
associated with autoimmunity including cell receptors and cells which produce self- 
directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis 
(RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent 

20 diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing 

spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's 
granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is 
characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the 
inflammatory cascade associated with autoimmune diseases. 

25 The simian adenoviral vectors of the invention are particularly 

well suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of 
transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in 
combination regimens involving delivery of other transgenes. Such regimens may involve 
administration of a Pan5, Pan6, Pan7, SV1 , SV25 or SV39 simian adenoviral vector, 

30 followed by re-administration with a vector from the same serotype adenovirus. 

Particularly desirable regimens involve administration of a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 simian adenoviral vector of the invention, in which the serotype of the viral 
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vector delivered in the first administration differs from the serotype of the viral vector 
utilized in one or more of the subsequent administrations. For example, a therapeutic 
regimen involves administration of a Pan5, Pan6, Pan7, SV1, SV25 or SV39 vector and 
repeat administration with one or more adenoviral vectors of the same or different 
5 serotypes. In another example, a therapeutic regimen involves administration of an 

adenoviral vector followed by repeat administration with a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 vector of the invention which differs from the serotype of the first delivered 
adenoviral vector, and optionally further administration with another vector which is the 
same or, preferably, differs from the serotype of the vector in the prior administration 

10 steps. These regimens are not limited to delivery of adenoviral vectors constructed using 
the Pan5, Pan6, Pan7, SV1, SV25 or SV39 simian serotypes of the invention. Rather, 
these regimens can readily utilize vectors other adenoviral serotypes, including, without 
limitation, other simian adenoviral serotypes (e.g., Pan9 or C68, CI, etc), other non- 
human primate adenoviral serotypes, or human adenoviral serotypes, in combination with 

15 one or more of the Pan5, Pan6, Pan7, SV1, SV25 or SV39 vectors of the invention. 

Examples of such simian, other non-human primate and human adenoviral serotypes are 
discussed elsewhere in this document. Further, these therapeutic regimens may involve 
either simultaneous or sequential delivery of Pan 5, Pan6, Pan7, SV1, SV25, and/or SV39 
adenoviral vectors of the invention in combination with non-adenoviral vectors, non-viral 

20 vectors, and/or a variety of other therapeutically useful compounds or molecules. The 
present invention is not limited to these therapeutic regimens, a variety of which will be 
readily apparent to one of skill in the art. 

B. Ad-Mediated Delivery of Immunogenic Transgenes 

The recombinant simian adenoviruses may also be employed as 

25 immunogenic compositions. As used herein, an immunogenic composition is a 

composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) 
response is mounted to a transgene product delivered by the immunogenic composition 
following delivery to a mammal, and preferably a primate. The present invention 
provides a recombinant simian Ad that can contain in any of its adenovirus sequence 

30 deletions a gene encoding a desired immunogen. The simian adenovirus is likely to be 
better suited for use as a live recombinant virus vaccine in different animal species 
compared to an adenovirus of human origin, but is not limited to such a use. The 



44 



WO 03/046124 



PCT/US02/33645 



recombinant adenoviruses can be used as prophylactic or therapeutic vaccines against any 
pathogen for which the antigen(s) crucial for induction of an immune response and able to 
limit the spread of the pathogen has been identified and for which the cDNA is available. 

Such vaccinal (or other immunogenic) compositions are formulated in a 
5 suitable delivery vehicle, as described above. Generally, doses for the immunogenic 
compositions are in the range defined above for therapeutic compositions. The levels of 
immunity of the selected gene can be monitored to determine the need, if any, for 
boosters. Following an assessment of antibody titers in the serum, optional booster 
immunizations may be desired. 

10 Optionally, a vaccinal composition of the invention may be formulated to 

contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, 
preservatives and the like. Such components are well known to those of skill in the 
vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, 
monophosphoryl lipid A, and any biologically active factor, such as cytokine, an 

15 interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of these 
biologically active factors can be expressed in vivo, e.g., via a plasmid or viral vector. 
For example, such an adjuvant can be administered with a priming DNA vaccine 
encoding an antigen to enhance the antigen-specific immune response compared with the 
immune response generated upon priming with a DNA vaccine encoding the antigen 

20 only. 

The recombinant adenoviruses are administered in a "an immunogenic 
amount", that is, an amount of recombinant adenovirus that is effective in a route of 
administration to transfectthe desired cells and provide sufficient levels of expression of 
the selected gene to induce an immune response. Where protective immunity is provided, 

25 the recombinant adenoviruses are considered to be vaccine compositions useful in 
preventing infection and/or recurrent disease. 

Alternatively, or in addition, the vectors of the invention may contain a 
transgene encoding a peptide, polypeptide or protein which induces an immune response 
to a selected immunogen. The recombinant adenoviruses of this invention are expected to 

30 be highly efficacious at inducing cytolytic T cells and antibodies to the inserted 
heterologous antigenic protein expressed by the vector. 
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For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response would 
be desirable include, the picornavirus family, which includes the genera rhinoviruses, 
which are responsible for about 50% of cases of the common cold; the genera 
5 enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human 
enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
responsible for foot and mouth diseases, primarily in non-human animals. Within the 
picornavirus family of viruses, target antigens include the VP 1, VP2, VP3, VP4, and 
VPG. Another viral family includes the calcivirus family, which encompasses the 

10 Norwalk group of viruses, which are an important causative agent of epidemic 

gastroenteritis. Still another viral family desirable for use in targeting antigens for 
inducing immune responses in humans and non-human animals is the togavirus family, 
which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, 
and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella 

15 virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. 
Louis encephalitis and tick borne encephalitis viruses. Other target antigens may be 
generated from the Hepatitis C or the coronavirus family, which includes a number of 
non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible 
gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline 

20 infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus 
(dog), and human respiratory coronaviruses, which may cause the common cold and/or 
non-A, B or C hepatitis. Within the coronavirus family, target antigens include the El 
(also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE 
or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N 

25 (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, 

which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general 
lyssavirus (e.g., rabies). Within the rhabdovirus family, suitable antigens may be derived 
from the G protein or the N protein. The family filoviridae, which includes hemorrhagic 
fever viruses such as Marburg and Ebola virus, may be a suitable source of antigens. The 

30 paramyxovirus family includes parainfluenza Virus Type 1 , parainfluenza Virus Type 3, 
bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 
2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
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morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the family 
orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the Nl protein). 
The bunyavirus family includes the genera bunyavirus (California encephalitis, La 
5 Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever 
virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The 
arenavirus family provides a source of antigens against LCM and Lassa fever virus. The 
reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis 
in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine 

10 encephalosis, blue tongue). 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and 
HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian 
immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious 

15 anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have 
been described and can readily be selected. Examples of suitable HIV and SIV antigens 
include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, 
as well as various fragments thereof. For example, suitable fragments of the Env protein 
may include any of its subunits such as the gpl20, gpl60, gp41, or smaller fragments 

20 thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the tat 

protein may be selected. [See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, 
the HIV and SIV proteins described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 
(March 2001), and R.R. Amara, et al, Science, 292:69-1 A (6 April 2001). In another 
example, the HIV and/or SIV immunogenic proteins or peptides may be used to form 

25 fusion proteins or other immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef 

fusion proteins and immunization regimens described in WO 01/54719, published August 
2, 2001, and WO 99/16884, published April 8, 1999. The invention is not limited to the 
HIV and/or SIV immunogenic proteins or peptides described herein. In addition, a 
variety of modifications to these proteins have been described or could readily be made 

30 by one of skill in the art. See, e.g., the modified gag protein that is described in US Patent 
5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or 
in combination. Such combinations may include expression from a single vector or from 
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multiple vectors. Optionally, another combination may involve delivery of one or more 
expressed immunogens with delivery of one or more of the immunogens in protein form. 
Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses (BKU 
5 and JCU viruses) and the sub-family papillomavirus (associated with cancers or 

malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, 
ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family 
feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and 
porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, 
10 which encompasses the genera simplexvirus (HSVI, HSVII), varicello virus 

(pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the 
genera cytomegalovirus (HCMV, muromegalovirus) and the sub-family 
gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts 
lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The 
15 poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera 
orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, 
capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. 
The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which 
may be suitable source of antigens is the Hepatitis delta virus. Still other viral sources 
20 may include avian infectious bursal disease virus and porcine respiratory and 

reproductive syndrome virus. The alphavirus family includes equine arteritis virus and 
various Encephalitis viruses. 

The present invention may also encompass immunogens which 
are useful to immunize a human or non-human animal against other pathogens including 
25 bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human 
and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial 
pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; 
and streptococci. Pathogenic gram-negative cocci include meningococcus; gonococcus. 
Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, 
acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; 
K ducreyi (which causes chancroid); brucella; Franisella tularensis (which causes 
tularemia); yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram- 
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positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; 
Corynebacterium diphtheria (diphtheria); cholera; B. anthracis (anthrax); donovanosis 
(granuloma inguinale); and bartonellosis. Diseases caused by pathogenic anaerobic 
bacteria include tetanus; botulism; other Clostridia; tuberculosis; leprosy; and other 
5 mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, 
pinta and endemic syphilis; and leptospirosis. Other infections caused by higher 
pathogen bacteria and pathogenic fungi include actinomycosis; nocardiosis; 
cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; candidiasis, 
aspergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, 

10 torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial infections 
include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox. 
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections 

15 produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; 
toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; 
giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and 
cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have 

20 been identified by the Centers for Disease Control [(CDC), Department of Heath and 
Human Services, USA], as agents which have potential for use in biological attacks. For 
example, some of these biological agents, include, Bacillus anthracis (anthrax), 
Clostridium botulinum and its toxin (botulism), Yersinia pestis (plague), variola major 
(smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers [filoviruses 

25 (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of which are 

currently classified as Category A agents; Coxiella burnetti (Q fever); Brucella species 
(brucellosis), Burkholderia mallei (glanders), BurkJwlderia pseudomallei (meloidosis), 
Ricinus communis and its toxin (ricin toxin), Closfridium perfringens and its toxin 
(epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), Chlamydia 

30 psittaci (psittacosis), water safety threats (e.g., Vibrio cholerae, Crytosporidium parvum\ 
Typhus fever (Richettsia powazekii), and viral encephalitis (alphaviruses, e.g., 
Venezuelan equine encephalitis; eastern equine encephalitis; western equine 
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encephalitis); all of which are currently classified as Category B agents; and Nipan virus 
and hantaviruses, which are currently classified as Category C agents. In addition, other 
organisms, which are so classified or differently classified, may be identified and/or used 
for such a purpose in the future. It will be readily understood that the viral vectors and 
5 other constructs described herein are useful to deliver antigens from these organisms, 
viruses, their toxins or other by-products, which will prevent and/or treat infection or 
other adverse reactions with these biological agents. 

Administration of the vectors of the invention to deliver 
immunogens against the variable region of the T cells elicit an immune response 
10 including CTLs to eliminate those T cells. In RA, several specific variable regions of 
TCRs which are involved in the disease have been characterized. These TCRs include 
V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at 
least one of these polypeptides will elicit an immune response that will target T cells 
involved in RA. In MS, several specific variable regions of TCRs which are involved in 
15 the disease have been characterized. These TCRs include V-7 and Va-10. Thus, delivery 
of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
immune response that will target T cells involved in MS. In scleroderma, several 
specific variable regions of TCRs which are involved in the disease have been 
characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, 
20 Va-15, Va-1 6, Va-28 and Va-12. Thus, delivery of a recombinant simian adenovirus that 
encodes at least one of these polypeptides will elicit an immune response that will target 
T cells involved in scleroderma. 

C. Ad-Mediated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can be 
monitored to determine the need, if any, for boosters. Following an assessment of CD8+ 
T cell response, or optionally, antibody titers, in the serum, optional booster 
immunizations may be desired. Optionally, the recombinant simian adenoviral vectors of 
the invention may be delivered in a single administration or in various combination 
regimens, e.g., in combination with a regimen or course of treatment involving other 
active ingredients or in a prime-boost regimen. A variety of such regimens have been 
described in the art and may be readily selected. 
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For example, prime-boost regimens may involve the administration of a 
DNA (e.g., plasmid) based vector to prime the immune system to second, booster, 
administration with a traditional antigen, such as a protein or a recombinant virus carrying 
the sequences encoding such an antigen. See, e.g., WO 00/1 1140, published March 2, 
5 2000, incorporated by reference. Alternatively, an immunization regimen may involve 
the administration of a recombinant simian adenoviral vector of the invention to boost the 
immune response to a vector (either viral or DNA-based) carrying an antigen, or a 
protein. In still another alternative, an immunization regimen involves administration of a 
protein followed by booster with a vector encoding the antigen. 
10 In one embodiment, the invention provides a method of priming and 

boosting an immune response to a selected antigen by delivering a plasmid DNA vector 
carrying said antigen, followed by boosting with a recombinant simian adenoviral vector 
of the invention. In one embodiment, the prime-boost regimen involves the expression 
of multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 
15 292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of 
protein subunits useful for generating an immune response against HIV and SIV. For 
example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and 
Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered 
in a recombinant adenovirus construct of the invention. Still other regimens are described 
20 in WO 99/16884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization for 
HIV or to delivery of these antigens. For example, priming may involve delivering with a 
first chimp vector of the invention followed by boosting with a second chimp vector, or 
with a composition containing the antigen itself in protein form. In one example, the 
prime-boost regimen can provide a protective immune response to the virus, bacteria or 
other organism from which the antigen is derived. In another desired embodiment, the 
prime-boost regimen provides a therapeutic effect that can be measured using convention 
assays for detection of the presence of the condition for which therapy is being 
administered. 

The priming composition may be administered at various sites in the body 
in a dose dependent manner, which depends on the antigen to which the desired immune 
response is being targeted. The invention is not limited to the amount or situs of 
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injection(s) or to the pharmaceutical carrier. Rather, the regimen may involve a priming 
and/or boosting step, each of which may include a single dose or dosage that is 
administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals 
may receive one or two doses containing between about 10 \ig to about 50 \ig of plasmid 
5 in carrier. A desirable amount of a DNA composition ranges between about 1 jag to about 
10,000 \xg of the DNA vector. Dosages may vary from about 1 jug to 1000 ^g DNA per 
kg of subject body weight. The amount or site of delivery is desirably selected based 
upon the identity and condition of the mammal. 

The dosage unit of the vector suitable for delivery of the antigen to the 
10 mammal is described herein. The vector is prepared for administration by being 

suspended or dissolved in a pharmaceutical^ or physiologically acceptable carrier such as 
isotonic saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled in 
the art and will depend in large part upon the route of administration. The compositions of 
15 the invention may be administered to a mammal according to the routes described above, 
in a sustained release formulation using a biodegradable biocompatible polymer, or by 
on-site delivery using micelles, gels and liposomes. Optionally, the priming step of this 
invention also includes administering with the priming composition, a suitable amount of 
an adjuvant, such as are defined herein. 
2 0 Preferably, a boosting composition is administered about 2 to about 27 

weeks after administering the priming composition to the mammalian subject. The 
administration of the boosting composition is accomplished using an effective amount of 
a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be composed 
25 of a recombinant viral vector derived from the same viral source (e.g., adenoviral 
sequences of the invention) or from another source. Alternatively, the "boosting 
composition" can be a composition containing the same antigen as encoded in the 
priming DNA vaccine, but in the form of a protein or peptide, which composition induces 
an immune response in the host. In another embodiment, the boosting composition 
30 contains a DNA sequence encoding the antigen under the control of a regulatory sequence 
directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or 
viral vectors. The primary requirements of the boosting composition are that the antigen 
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of the composition is the same antigen, or a cross-reactive antigen, as that encoded by the 
priming composition. 

In another embodiment, the simian adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens. 
5 Such regimens may involve delivery of simian adenoviral vectors of the invention 

simultaneously or sequentially with Ad vectors of different serotype capsids, regimens in 
which adenoviral vectors of the invention are delivered simultaneously or sequentially 
with non-Ad vectors, regimens in which the adenoviral vectors of the invention are 
delivered simultaneously or sequentially with proteins, peptides, and/or other biologically 
10 useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one 
of skill in the art. 

The following examples illustrate the cloning of the simian adenoviruses 
and the construction of exemplary recombinant adenovirus vectors of the present 
invention. These examples are illustrative only, and do not limit the scope of the present 
15 invention. 

Example 1 - Viral Propagation 

The Pan5 [ATCC Accession No. VR-591], Pan6 [ATCC Accession No. 
VR-592], and Pan7 [ATCC Accession No. VR-593] viruses, originally isolated from 

20 lymph nodes from chimpanzees, were propagated in 293 cells [ATCC CRL1573]. 
Typically, these cells are cultured in Dulbecco's Modified Eagles Medium (DMEM; 
Sigma, St. Louis, MO.) supplemented with 10% fetal calf serum (FCS) [Sigma or 
Hyclone, Logan, UT] and 1 % Penicillin-Streptomycin (Sigma). Infection of 293 cells is 
carried out in DMEM supplemented with 2% FCS for the first 24 hours, after which FCS 

25 is added to bring the final concentration to 1 0%. Infected cells are harvested when 100% 
of the cells exhibit virus-induced cytopathic effect (CPE), and are then collected, and 
concentrated by centrifugation. Cell pellets are resuspended in 10 mM Tris (pH 8.0), and 
lysed by 3 cycles of freezing and thawing. Virus preparations are obtained following two 
ultra centrifugation steps on cesium chloride density gradients and stocks of virus are 

30 diluted to 1 to 5 x 1 0 12 particles/ml in 1 0 mM Tris/1 00 mM NaCl/50% glycerol and stored 
at-70°C. 
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The ability of 293 cells to propagate these adenoviruses exceeded expectations 
which were based on knowledge of other non-human adenovirus serotypes. 



Virus Yield (virus particles produced in 8xl0 8 cells^l 
Pan5 8.8 xlO 13 

5 Pan6 1.6 xlO 14 

Pan7 8.8 xlO 13 

Example 2 - Characterization of Viral Genomic DNA 
10 Genomic DNA was isolated from the purified virus preparations of 



Example 1 and digested with Hindlll or BamHI restriction enzymes following the 
manufacturers' recommendations. The results (not shown) revealed that that the Pan5, 
Pan6, Pan7 genomes of the invention and the published Pan 9 (C68) genome show 
different restriction patterns, and thus, are distinct from each other. 

!5 The nucleotide sequences of Pan5, Pan6 and Pan7 were determined. The 

nucleotide sequence of the top strand of Pan5 DNA is reported in SEQ ID NO: 1. The 
nucleotide sequence of the top strand of Pan6 DNA is reported in SEQ ID NO: 5. The 
nucleotide sequence of the top strand of Pan 7 DNA is reported in SEQ ID NO: 9. 

Regulatory and coding regions in the viral DNA sequences were 

20 identified by homology to known adenoviral sequences using the "Clustal W" program 
described above at conventional settings. See the tables above providing the adenoviral 
sequences. Open reading frames were translated and the predicted amino acid sequences 
examined for homology to previously described adenoviral protein sequences, Ad4, Ad5, 
Ad7, Adl2,and Ad40. 

25 Analysis of the sequence revealed a genome organization that is similar to 

that present in human adenoviruses, with the greatest similarity to human Ad4. However, 
substantial differences in the hexon hypervariable regions were noted between the 
chimpanzee adenoviruses and other known adenoviruses, including AdHu4. These 
differences fit well with the serological cross-reactivity data that has been obtained (see 

30 below). 

An alignment of a portion of the hexon sequences is shown in Fig. 1. The 
portion shown is from the region of the hexon that corresponds to the outwardly disposed 
extended loops DEI and FG1 where the most variability between serotypes is observed. 
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An intervening portion that contributes to the base of the hexon (corresponding to 
residues 308-368 of the published AdC68 sequence; US Patent 6,083,716), and is highly 
conserved between serotypes, is also present. The following table summarizes the pair- 
wise comparisons of the amino acids in the hexon proteins. 



Comparison 


Hexon amino-acid 
Similarity (%) 


#1 


§2 




AdC5 


AdC7 


99.0 


AdC5 


AdC68 


98.3 


AdC5 


AdC6 


88.0 


AdC5 


AdCI 


84.9 


AdC6 


AdC7 


87.7 


AdC6 


AdC68 


87.3 


AdC6 


AdCI 


84.9 


AdC7 


AdC68 


97.5 


AdC7 


AdCI 


84.8 


AdC68 


AdCI 


84.9 



Analysis of the fiber knob domain (which is responsible for receptor 
binding) of the chimpanzee adenoviruses shows an overall similarity in structure (Fig. 2). 
10 The degree of sequence similarity between the El proteins of huAd5 and 

C68 (see Tables below) is similar to that between huAdS and Pan-5, Pan-6, and Pan-7. 



55 



WO 03/046124 



PCT/US02/33645 



Comparison 


Ela (13S) amino-acid 
identity (%) 


■U 1 

#1 


#2 




AdHu5 


AdC5 


36.6 


AdHu5 


AdCo 


28.5 


A JT T.. C 

AdHu5 


AdC7 


34.9 


A jt r. . c 

AdHuS 


AdC68 


35.6 


A JT T..C 

AdHu5 


Add 


35.6 


AdC5 


AdCo 


68.3 


AdC5 


AdC7 


96.9 


AdC5 


AdCoo 


80.4 






51.3 


AdC6 


AdC7 


69.3 


AdC6 


AdC68 




AdC6 


AdCl 


37.7 


AdC7 


AdC68 


81.5 


AdC7 


AdCl 


51.0 


AdC68 


AdCl 


54.9 





Sequence Identity with human 
Ad5 




Elb Small T 
Protein 


Elb Large T 
Protein 


C68 


47.3% 


55.8% 


Pan-5 


43.2% 


54.5% 


Pan-6 


45.3% 


54.5% 


Pan-7 


46.4% 


53.8% 
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Replication-defective versions of AdC5, AdC6 and AdC7 were created by 
molecular cloning methods described in the following examples in which minigene 
cassettes were inserted into the place of the Ela and Elb genes. The molecular clones of 
the recombinant viruses were rescued and grown up in 293 cells for large-scale 
5 purification using the published CsCl sedimentation method [K. Fisher et al 9 J. ViroL, 
70:520 (1996)]. Vector yields were based on 50 plate (150 mm) preps in which 
approximately 1 x 10 9 293 cells were infected with the corresponding viruses. Yields 
were determined by measuring viral particle concentrations spectrophotometrically. After 
having constructed El-deleted vectors, it was determined that HEK 293 cells (which 

10 express human adenovirus serotype 5 El functions) trans-complement the El deletions of 
the novel viral vectors and allow for the production of high titer stocks. Examples of 
virus yields for a few of these recombinant viruses are shown in the table below. 

The transgenes for these vectors, P-galactosidase (LacZ), green 
fluorescent protein (GFP), alpha-l-anti-trypsin (Al AT), ebola glycoprotein (ebo), a 

15 soluble ebola glycoprotein variant lacking the transmembrane and cytoplasmic domains 
(sEbo), and three deletion mutants of the ebola glycoprotein (EboA2, EboA3, and 
EboA4), were expressed by the cytomegalovirus promoter (CMV). In the following table, 
ND indicates that the study has not yet been done. 



Transgene 


Viral backbone/Vector yield (Viral particles x 10 13 ) 




AdHu5 




AdC7 




AdC68 




AdC6 


CMVLacZ 


1.5 




1.4 




3.3 




6.1 


CMVGFP 


2.5 




3.6 




8 




10 


CMVA1AT 


3.7 




6 




10 




ND 


CMVEbo 


1.1 




4.3 




ND 




ND 


CMVsEbo 


4.9 




5.4 




ND 




ND 


CMVEboA2 


1 




9.3 




ND 




ND 


CMVEboA3 


0.8 




9.5 




ND 




ND 


CMVEboA4 


1.4 




6.2 




ND 




ND 
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The ability of human adenovirus El to trans-complement the El -deleted 
chimpanzee viruses of the invention is highly advantageous, as it permits the production 
of El -deleted chimpanzee adenoviral vectors of the invention, while reducing or 
eliminating the risk of homologous recombination due to the differences in sequences 
5 between human Ad and the chimpanzee adenoviruses described herein. 

Example 3 - Serological Studies of Pan 5, 6, and 7 Viruses 

Because of the differences in the hexon hypervariable region, it was anticipated 
that the C5, C6, and C7 viruses would be serologically distinct from human adenoviruses, 
10 including AdHu4. 

1 . Cross-Reactivity of Wild-type Viruses 

For screening of wild-type viruses in order to make a determination of 
antibody cross-reactivity, the replication competent viruses were used and inhibition of 
cytopathic effects (CPE) was measured. Briefly, preparations of adenoviruses (AdhuS, 
15 Pan-5, Pan-6, Pan-7 and AdC68) stored at 5xl0 12 particles/ml were diluted 1/600 for the 
assays. This concentration of virus was selected since it results in 1 00% CPE within 48 
hours in the absence of neutralization. Prior to adding the virus to 293 cells (4x1 0 4 
cells/well in a 96 well dish), 1:20 dilutions of sera were added. The assay is read as the 
presence or absence of CPE; flill neutralization would read as no cytopathic effect. The 
20 results are summarized in the Table below. The fact that 9/36 human sera neutralized 

AdhuS induced CPE is consistent with previous estimates of neutralizing antibodies in the 
human population. The numbers indicate the total individuals who showed neutralization 
(numerator) versus the total number screened (denominator). ND = not determined. 

Neutralization by 1/20 diln of serum 





Human 


Rhesus 


Chimpanzee 




(N=36) 


(N=52) 


(N=20) 


Adhu5 


9/36 


ND 


ND 


AdC68 


1/36 


0/52 


12/20 


Pan 5 


0/36' 


0/52 


10/20 


Pan 6 


0/36 


0/52 


9/20 


Pan 7 


0/36 


0/52 


12/20 
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Of all human sera screened, 35/36 were negative for neutralization to AdC68 
while 36/36 were negative for neutralization to Pan-5, Pan-6 and Pan-7. Of 52 rhesus 
monkeys screened, none showed neutralization to any chimpanzee adenovirus; rhesus 
monkey is the preferred pre-clinical model for evaluating HIV vaccines. Between 9 to 12 
5 out of 20 chimpanzees had substantial neutralization to one or another of the chimpanzee 
adenoviruses consistent with the fact these are indeed endemic chimpanzee-specific 
pathogens. Interestingly, there are chimpanzees with neutralizing antibodies only to Pan- 
5, Pan-6 or AdC68 supporting the hypothesis that several of these chimpanzee adenoviral 
vectors will not cross neutralize each other and are distinct serotypes. 

10 The same assay was carried out for 20 chimpanzee serum samples. Fifty percent 

(50%) of the samples reacted serologically, in different degrees to Pan5; 40% to Pan6; 
55% to Pan7 and 60% to C68. Among the positive serum samples, one of them had 
strong neutralizing activity to all four chimp viruses. 

2. Cross-neiifralization with Recombinant Viruses 

15 High-titer polyclonal antibodies were obtained to each of the simian 

adenoviruses in order to more precisely gauge the degree of cross-neutralization among 
the different serotypes. This was done by intramuscular immunization of rabbits using a 
recombinant virus containing GFP based on previously the described C68 chimpanzee 
adenovirus as an adjuvant. The serum was then used to assay for neutralizing activity 

20 against each of the three chimpanzee adenoviruses of the invention, AdC5, AdC6 and 
AdC7. A rabbit was injected with 5xl0 12 viral particle per kg of C68CMVGFP vector 
intramuscularly and boosted 5 weeks later using the same dose. A bleed collected at the 9 
week time point revealed extremely potent neutralizing activity against C68 as well as 
Pan-5 and Pan-7 but not against Pan-6 (see Table below), indicating that the 

25 administration of a C68 (or Pan-5 and Pan-7) based vaccine could be effectively followed 
by a boost using a vector based on Pan-6. However, it has been found that this level of 
inter-relatedness does not necessarily prevent with re-administration in a setting where 
antiviral antibody titers were not as high as was achieved in this rabbit. In the following 
table, + indicates 33% CPE; ++ indicates 66% CPE; +-H- indicates 100% CPE. 
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Infection on 293 cells with virus: 




Pan5 


Pan6 


Pan7 


Pan9(C68) 


C68 
GFP 


Serum 
Dilution 


- 


+++ 


- 


- 


- 


1/20 


- 


+++ 


- 


- 


- 


1/40 


- 


+++ 


- 


- 


- 


1/80 


- 


+++ 




- 


- 


1/160 


- 


+++ 




- 


- 


1/320 


- 


+++ 




- 


- 


1/640 


- 


+4+ 


- 


- 


- 


1/1,280 


- 


+4+ 




- 


- 


1/2,560 




4-44 




- 


- 


1/5,120 


4 


+44 




- 


- 


1/10,240 


+ 


4-44 


++ 


- 


- 


1/20,480 i 


++ 


+4+ 


+++ 


- 


- 


1/40,960 


++ 


4-44 


+++ 


4 


4 


1/81,920 


444 


+44 


+++ 


+4 


4+ 


1/163,840 


444 


+++ 


+++ 


44+ 


i ii 


1/327,680 


+4+ ! 


+++ 


+++ 


444 


+++ 


1/665,360 


4++ 


+++ 


4++ 


444 


+++ 


1/1,310,720 


444 


+++ 


4++ 


44+ 


+++ 


1/2,621,440 
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3 . Quantitative Assay for Detection of Neutralizing Antibody 

The result was validated by a more quantitative-based assay for detecting 
neutralizing antibody, which is based on transduction of a GFP vector. Briefly, groups of 
C57BL/6 mice were immunized intramuscularly or intravenously with 5.0 x 10 10 
5 particles/ml Pan5, Pan6, Pan7 or C68. Sera from day 28 bleeds were tested for cross- 
neutralizing activity against C68CMVEGFP at dilutions of 1/20 and 1/80. In summary, 
when a pharmaceutical preparation of human immunoglobulin was tested for serological 
reactions to Pan 5, 6, and 7, and C68, some low levels of neutralizing activities against 
Pan 7 and C68 were detected. No neutralizing activity against Pan5 or Pan6 was 
10 detected. Serum samples from 36 human subjects were run for the same assay. Serum 
samples were tested at a 1/20 dilution. The results indicated that only one individual has 
clear neutralizing activity to C68. No neutralizing activity to Pan5, Pan6 or Pan7 was 
detected. 

4. In Vitro Cross-Neutralization 
Cross-neutralization of the simian adenoviruses by high-titer rabbit 

polyclonal antibodies raised against each of the adenoviruses Pan-5, Pan-6, Pan-7, and 
C68 was tested. 

Rabbits were immunized with intra-muscular injections of 10 13 particles 
of each of the chimpanzee adenoviruses and boosted 40 days later with the same dose 
with incomplete Freund's adjuvant. Sera were analyzed or the presence of neutralizing 
antibodies by incubating serial two-fold dilutions with 10 9 genome copies of each of the 
appropriate chimpanzee adenovirus vector expressing GFP and testing for the attenuation 
of GFP expression when applied to 293 cells. The serum dilution which produced a 50% 
reduction of GFP expression was scored as the neutralizing antibody titer against that 
particular virus. 

The results are shown in the Table. The data are consistent with the 
expectation from sequence analysis of the hexon amino-acid sequences, which indicated 
that Ad Pan-6 was likely to be the most serologically distinct compared to the other 
chimpanzee adenoviruses. 
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Infection of 293 cells with 10 9 genome copies of 


Serum from 
rabbit 
immunized 
with: 


Ad Pan-5 


Ad Pan-6 


Ad Pan-7 


Ad C68 


Ad Pan-5 


1/5120 


<l/20 


1/2560 


1/2560 


A A Pan & 


No 

neutralization 


1/20,480 


<l/20 


<l/20 


Ad Pan-7 


1/2560 


1/160 


1/163,840 


1/2560 


Ad C68 


No 

neutralization 


<l/20 


<l/20 


1/5120 



In order to determine whether antibodies cross-reacting with the simian 
adenoviruses were likely to be of low prevalence in humans, simian adenoviruses SV1, 
SV39, and SV25 were tested for their ability to withstand neutralization when incubated 
5 with commercially available pooled human immunoglobulins (Ig). The same assay was 
also performed with AdhuS and the chimpanzee adenoviruses Pan-5, Pan-6, Pan-7, and 
C68. In a further study, sera from mice has been immunized with one of the chimpanzee 
adenoviruses C5, C6, C7, and C68 and their ability to cross-neutralize the simian 
adenoviruses SV-15, SV-23, SA-17, and Baboon Adenovirus has been tested. No cross- 
10 reactivity was observed in any case. 

Example 4 - Generation of Recombinant El -Deleted Pan5 Vector 

A modified pX plasmid was prepared by destroying the Fspl site in the bla 
gene region of pX (Clontech) by site-directed mutagenesis. The resulting modified 
15 plasmid, termed pX', is a circular plasmid of 3000 bp which contains an fl ori and an 
ampicillin resistance gene (AmpR-cds). 

A. Production of Pan-5 Adenovirus Plasmid 

A polylinker for sequential cloning of the Pan5 DNA fragments 
into pX' is created. The polylinker is substituted for the existing pX' polylinker 
20 following digestion with Mini and EcoRI. The blunt-Fsel fragment of the Pan 5 is 

inserted into the Smal and Fsel sites of the polylinker. This fragment contains the 5' end 
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of the adenoviral genome (bp 1 to 3606, SEQ ID NO:l). The SnaBI-FspI fragment of 
Pan 5 (bp 455 to 3484, SEQ DDNO:l) is replaced with a short sequence flanked by I-Ceu 
and Pl-Sce sites from pShuttle (Clontech), to eliminate the El region of the adenoviral 
genome. The EcoRI-blunt fragment of Pan5 (bp 28658 to 36462, SEQ ID NO:l) is 
5 inserted into the EcoRI and EcoR V sites of the polylinker (to provide the 3 ' end of the 
adenoviral genome); the Fsel-Mlul fragment (bp 3606 to 15135, SEQ ID NO:l) is 
inserted into the polylinker; and the MluI-EcoRI fragment is inserted into the polylinker 
(bp 15135 to 28658, SEQ ID NO:l). Optionally, a desired transgene is inserted into I- 
Ceul and Pl-Scel sites of the newly created pX'Pan5AEl vector. 

B. Alternative Method of Generating pX'Pan5AEl . 
The initial plasmid pX is derived from pAdX adenovirus 

plasmid available from Clontech, as described above. Thereafter, a Pacl-Xhol region 
of pX' was deleted and the blunt-ended Pan5 polylinker was inserted into the Fspl site 
to generate pX'PLNK (2994 bp). The 5 'end-Fsel region of Pan 5 (bp 1-3607, SEQ 
ID NO:l) was inserted into Smaland Fsel sites of pX'LNK to generate the pX'Pan5- 
5' plasmid (6591 bp). The SnaBi-Ndel region of pX'Pan5-5' was excised and 
replaced with the Ceu/Sce cassette, which had been PCR amplified from pRCS to 
create pX'Pan5-5'AEl (4374 bp). Briefly, a sequence containing I-Ceul and Pl-Scel 
rare cutter sites was PCR amplified from pRCS (31 13bp). The 3' PCR primer was 
introduced an Ndel site into the PCR product. 

To extend the Pan5 DNA in pX'Pan5-5'AEl (4374 bp), the 
Fsel-Mlul region of Pan 5 (bp 3607-15135, SEQ ID NO:l) is added, to create 
pX'Pan5-5'Mlu (15900 bp). The remaining MluI-3' end of the Pan5 sequence (bp 
15135-36462, SEQ ID NO:l) is added to the vector between the Mlul and EcoRV 
sites of the vector polylinker to form pX'Pan5AEl which contains the full-length 
Pan5 sequence containing a deletion in the El region. 

C. Generation of Recombinant Viruses 
To generate the recombinant adenoviruses from pX'Pan5AEl, 

the plasmid is co-transfected with a helper expressing El, or from an El -expressing 
packaging cell line, such as 293 cell line or a cell line prepared as described herein. 
The expression of El in the packaging cell permits the replication and packaging of 
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the Pan5 AE1 into a virion capsid. In another embodiment, the packaging cell 
transfected with pXTan5AEl is transfected with an adenovirus vector as described 
above bearing the transgene of interest. Homologous recombination occurs between 
the helper and the plasmid, which permits the adenovirus-transgene sequences in the 
5 vector to be replicated and packaged into virion capsids, resulting in the recombinant 
adenovirus. 

Transfection is followed by an agar overlay for 2 weeks, after 
which the viruses are plaqued, expanded and screened for expression of the transgene. 
Several additional rounds of plaque purification are followed by another expansion of 
10 the cultures. Finally the cells are harvested, a virus extract prepared and the 

recombinant chimpanzee adenovirus containing the desired transgene is purified by 
buoyant density ultracentrifiigation in a CsCl gradient or by alternative means known 
to those of skill in the art. 



1 5 Example 5 - Generation of Recombinant El -Deleted Pan6 Vector 

A. Strategy for Construction of Pan-6 Adenoviral Plasmid 
L Cloning of terminal fragments 

Pan 6 virus is deproteinated by pronase and proteanase K 
treatment and phenol extraction. Synthetic 1 2 bp Pme I linkers are ligated onto the viral 
20 DNA as described by Berkner and Sharp, Nucleic Acids Research, 11: 6003 (1983). The 
viral DNA is then digested with Xba I to isolate a 5' terminal fragment (6043 bp). The 
Ad6 Xbal 5' fragment is then ligated into pX link at Sma I and Xba I sites to form pX- 
AdPan6-0-16.5. The viral DNA with Pme I linkers is also digested with Pac I to isolate 
the 6475 bp 3' terminal fragment and cloned into pX link at Pac I and Sma I sites, 
2 5 resulting in pXAdPan6-82- 1 00. 

2. Deletion of El from the 5 3 clone 

To delete El (m.u. 1 .2-9), the BsiWi-Xba I fragment in pX- 
AdPan6-0-16.5 is replaced with a PCR fragment spanning m.u.9-16.7 fragment treated 
with BsiWi and Xba I, leading to pX-Ad-Pan6 m.u.0-1, 9-16.5 . 
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3. Fusion of 5 1 and 3 ' clones and to create an anchor site to 
accept the middle Hind III fragment 

First, the 5 5 clone, pX-Ad-Pan6 m.u.0-1, 9-16.5, is further 
5 expanded by inserting the 2 nd Xba I fragment (4350 bp, m.u.16.5 - 28) from Pan 6 

genome into the Xba I site in the pX-Ad-Pan6 m.u.0-1, 9-16.5. This construct is named 
pXAd-Pan6-mu 0-1, 9-28. 

Second, the V clone is also expanded by inserting the 15026 bp 
Mlu I/Pac I fragment covering m.u.41-82 from Pan 6 genome into the Mlu I/Pac I sites of 
10 pXAdPan6-82-100, generating pXAdPan6-m.u.41-100. 

Then, a 8167 bp Hind in/Eco 47III Pan 6 fragment is isolated 
from pXAd-Pan6-mu 0-1, 9-28 and subcloned into pXAdPan6-m.u.41-100 at Hind III and 
Xba I blunt sites. This 5' and 3' fusion clone is called pXAdPan6rnu0-l, 9-19.5, 64-100. 

4. Drop of the middle fragment of the genome into the fusion 

15 clone 

A 16335 bp Hind III fragment (m.u.19.5 - 64) from Pan 6 is 
inserted into Hind III site of pXAdPan6muO-l, 9-19.5, 64-100 to form pXAdPan6-0-l, 9- 
100. 

5. Introduction of a PKGFP selective marker into the final 
20 construct for direct cloning the gene of interest and green/white selection of 

recombinant transformants. 

A minigene cassette that expresses GFP under a lac promoter and 
is flanked with recognition sites of rare intron encoding restriction enzymes, Pl-Sce I and 
I-Ceu I, was isolated from pShuttle-pkGFP (bare) by Sap I and Dra III digestions 

25 followed by filling-in reaction. The pShuttle-pkGFP (bare) plasmid is 4126 bp in length, 
and contains a ColEl-Ori, a kanamycin resistance gene, plac, a LacZ promoter-GFPmut3- 
1 cds (Clontech), and a GFPmut3-l cds (Clontech). This cassette is subcloned into Srf I 
cut and blunted pXAdPan6-0-l, 9-100. This final construct is called pX-Pan6-pkGFP 
mu.0-1, 9-100, which is useful for generating recombinant El-deleted Pan 6 molecular 

30 clones carrying genes of interest by direct ligation and green/white selection in 
combination with the generic pShuttlepkGFP vectors. 
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B. Alternative Strategy for Generation of Pan-6 Plasmid 

1. Cloning of 5 ' terminal fragment 

The Pan 6 virus is deproteinated by pronase and 
proteanase K treatment and phenol extraction as described above and synthetic 12 bp 
5 Pme I linkers are ligated onto the viral DNA as described. The AdPan6 5' Xbal fragment 
is isolated and ligated into pX to form pX-AdPan6-0-16.5 (9022 bp) as described in Part 
A above. 

2. Deletion of El from the 5 1 clone 

To delete El (m.u. 1.2-9), pX-AdPan6-0-16.5 is digested 
10 with SnaBI and Ndel to remove the regions encoding the El a and Elb proteins (3442- 
6310 bp). This vector is subsequently digested with BsiWI in preparation for blunting 
with the minigene cassette carrying a selective marker. 

3. Introduction of a selective marker 

A minigene cassette that expressed GFP under a lac 
15 promoter and which is flanked with recognition sites of rare intron encoding restriction 
enzymes, Pl-Xcel and I-Ceul, was isolated from pShuttle-pkGFP as described above. 
The Dralll-Sapl fragment is then ligated with the digested pX-AdPan6-0-16.5 to form 
pX-AdPan6 MU 0-16.5AE1 (7749 bp). 

4. Extension of Pan-6 Adenoviral Sequences 

20 pX-AdPan6 MU 0-16.5AE1 was subjected to Xbal 

digestion to permit insertion of an Xbal-RsrII linker. An Xbal/Rsrll digestion fragment 
from the AdPan6 genome was isolated (mu 28-100, 26240 bp) and ligated into the 
Xba/RsrII-digested pX-AdPan6 MU 0-16.5AE1 to provide pX-AdPan6 MU 0-1, 9-16.5, 
28-100. A second Xbal fragment from the Pan6 genome (mu 1 6.5-28, 4350 bp) is then 

25 ligated into this plasmid to form pX-AdPan6 MU 0-1, 9-100 (3855 1 bp). 

C. Generation of Recombinant Adenoviruses 

To generate the recombinant adenoviruses from a El -deleted Pan6 
plasmid prepared as described in Parts A or b, the plasmid is co-transfected with a helper 
expressing El, or from an El-expressing packaging cell line, such as 293 cell line or a 

30 cell line prepared as described herein. The expression of El in the packaging cell permits 
the replication and packaging of the Pan6-pkGFP mu.0-1, 9-100 into a virion capsid. 
Alternatively, the packaging cell transfected with pX-Pan6-pkGFP mu.0-1, 9-100 is 
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transfected with an adenovirus vector as described above bearing another transgene of 
interest. 

Example 6 - Generation of Recombinant El -Deleted Pan7 Vector 
5 A. Generation of Pan 7 Plasmids 

A synthetic linker containing the restriction sites Pacl-Smal-Fsel- 
MluI-EcoRV-PacI was cloned into pBR322 that was cut with EcoRI and NdeL The left 
end (bpl to 3618) of Ad Pan7 was cloned into the linker between the Smal and Fsel sites. 
The adenovirus El was then excised from the cloned left end by cutting with SnaBI and 

10 Ndel and inserting an I-CeuI-GFP-PI-Scel cassette from pShuttle (Clontech) in its place. 
The resulting plasmid was cut with Fsel and Mlul and Ad Pan7 fragment Fsel (bp 361 8) 
to Mlul (bp 155114 was inserted to extend the left end. The construct (pPan7pGFP) was 
completed by inserting the 21421 bp Ad Pan7 right end fragment from the Mlul site (bp 
151 14) into the above plasmid between Mlul and EcoRV to generate a complete 

15 molecular clone of El deleted adenovirus Pan7 suitable for the generation of recombinant 
adenoviruses. Optionally, a desired transgene is inserted into the I-Ceul and Pl-Scel sites 
of the newly created pPan7 vector plasmid. 

B. Construction of El -Deleted Pan 7 Viral Vectors 

To generate the recombinant adenoviruses from pPan7AEl, the 

20 plasmid is co-transfected with a helper expressing El, or from an El -expressing 

packaging cell line, such as 293 cell line or a cell line prepared as described herein. The 
expression of El in the packaging cell permits the replication and packaging of the 
Pan7AEl into a virion capsid. In another embodiment, the packaging cell transfected 
with pX'Pan7 AE1 is transfected with an adenovirus vector as described above bearing 

25 the transgene of interest. Homologous recombination occurs between the helper and the 
plasmid, which permits the adenovirus-transgene sequences in the vector to be replicated 
and packaged into virion capsids, resulting in the recombinant adenovirus. Transfection 
and purification is as described above. 

30 Example 7 - Generation of Plasmid Vectors Expressing the El Genes 

Plasmid vectors are constructed which encode the Pan5 El region gene, and these 
plasmids are used to generate stable cell lines expressing viral El proteins. 
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The El region of Pan5 is cloned into pX\ essentially. as described in Example 4 
above, prior to replacement of this region with the fragment from pShuttle (Clontech). 
The expression plasmid contains the Pan5 adenoviral genome sequence spanning at least 
bp 1 to 3959 in the Pan5 genomic sequence. Thus, the expression plasmid contains the 
5 sequence encoding El a and Elb of chimpanzee Ad Pan5 under the control of a 

heterologous promoter. Similar expression plasmids can be generated using the Ad Pan6 
and AdPan 7 El regions, identified in the tables above. 

Example 8 - Generation of Cell Lines Expressing Chimpanzee Adenovirus El Proteins 

10 Cell lines expressing viral El proteins are generated by transfecting HeLa (ATCC 

Acc. No. CCL2) with the plasmid of Example 6. These cell lines are useful for the 
production of El -deleted recombinant chimpanzee adenoviruses by co-transfection of 
genomic viral DNA and the expression plasmids described above. Transfection of these 
cell lines, as well as isolation and purification of recombinant chimpanzee adenoviruses 

15 therefrom are performed by methods conventional for other adenoviruses, i.e., human 
adenoviruses [see, e.g., Horwitz, cited above and other standard texts]. 
A. Cell lines expressing Pan5 El proteins 

HeLa cells in 10cm dishes are transfected with 10 jxg of pX-Pan51-El 
DNA using a Cellphect™ kit (Pharmacia, Uppsala, Sweden) and following the 

20 manufacturer's protocol. 22 hours post-transfection, the cells are subjected to a three 
minute glycerol shock (15% glycerol in Hepes Buffered Saline, pH 7.5) washed once in 
DMEM (HeLa) or F12K (A549; Life Technologies, Inc., Grand Island, NY) media 
supplemented with 10% FCS, 1% Pen-Strep, then incubated for six hours at 37°C in the 
above described media. The transfected cells are then split into duplicate 15cm plates at 

25 ratios of 1:20, 1:40, 1:80, 1:160, and 1:320. Following incubation at 37°C overnight, the 
media is supplemented with G41 8 (Life Technologies, Inc.) at a concentration of 1 fig/ml. 
The media is replaced every 5 days and clones are isolated 20 days post-transfection. 

HeLa El cell clones are isolated and assayed for their ability to augment 
adeno-associated virus (AAV) infection and expression of recombinant LacZ protein as 

30 described below. 
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B. AAV Augmentation Assay for Screening El Expressing Cell 

Lines 

AAV requires adenovirus-encoded proteins in order to complete 
its life cycle. The adenoviral El proteins as well as the E4 region-encoded ORF6 protein 
5 are necessary for the augmentation of AAV infection. An assay for El expression based 
on AAV augmentation is used. Briefly, the method for identifying adenoviral El- 
expressing cells comprises the steps of infecting in separate cultures a putative adenovirus 
El -expressing cell and a cell containing no adenovirus sequence, with both an adeno- 
associated virus (AAV) expressing a marker gene and an AAV expressing the ORF6 of 
10 the E4 gene of human adenovirus, for a suitable time. The marker gene activity in the 
resulting cells is measured and those cells with significantly greater measurable marker 
activity than the control cells are selected as confirmed El-expressing cells. In the 
following experiment, the marker gene is a lacZ gene and the marker activity is the 
appearance of blue stain. 

For example, the cell lines described above, as well as 
untransfected control cells (HeLa) are infected with 100 genomes per cell of an AAV 
vector bearing a marker gene, e.g., AV.LacZ [K. Fisher et ai, J. Virol., 70:520 (1996)] 
and an AAV vector expressing the ORF6 region of human 5 (AV.orf6). The DNA 
sequence of the plasmid generates a novel recombinant adeno-associated virus (rAAV) 
containing the LacZ transgene and the Ad E4 ORF 6, which is an open reading frame 
whose expression product facilitates single-stranded (ss) to double-stranded (ds) 
conversion of rAAV genomic DNA. These vectors are incubated in medium containing 
2% FCS and 1% Pen-Strep at 37°C for 4 hours, at which point an equal volume of 
medium containing 10% FCS is added. It should be understood by one of skill in the art 
that any marker gene (or reporter gene) may be employed in the first AAV vector of this 
assay, e.g., alkaline phosphatase, luciferase, and others. An antibody-enzyme assay can 
also be used to quantitate levels of antigen, where the marker expresses an antigen. The 
assay is not limited by the identity of the marker gene. Twenty to twenty-four hours 
post-infection, the cells are stained for LacZ activity using standard methods. After 4 
hours the cells are observed microscopically and cell lines with significantly more blue 
cells than the A549 or HeLa cell controls are scored as positive. 
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Example 9 - Delivery of Transgene to Host Cell 

The resulting recombinant chimpanzee adenovirus described in Example 4, 5 or 6 
above is then employed to deliver the transgene to a mammalian, preferably human, cell. 
For example, following purification of the recombinant virus, human embryonic kidney 
5 293 cells are infected at an MOI of 50 particles per cell. GFP expression was 
documented 24 hours post-infection. 

A. Gene Transfer in Mouse Models via Pan-6, Pan-7, and Pan-9 vectors 
Gene transfer efficiencies and toxicological profile of recombinant 

chimpanzee adenoviruses were compared in mouse liver directed gene transfer, mouse 

10 lung directed gene transfer, and mouse muscle directed gene transfer. 

El -deleted adenoviral vectors containing LacZ under the control of the 
CMV promoter were constructed using the techniques herein for human Ad5, chimpanzee 
Pan 6, chimpanzee Pan 7 and chimpanzee Pan 9 (C68). The vectors were delivered to 
immune-deficient NCR nude mice (80 for each study) as follows. For the liver study, 100 

15 ^1 (1 xlO 11 particles) were injected into the tail vein. For the lung study, 50 \il (5 xlO 10 
particles) were delivered intratracheally. For the muscle study, 25 \il (5 xl0 10 particles) 
were injected into tibialis anterior. The mice were sacrificed on days 3, 7, 14 and 28 post- 
vector injection (5 animals per group at each time point). At each necropsy, the 
liver/lung/muscle tissue was harvested and prepared for cryoblocks and paraffin 

20 embedding. The cryoblocks were sectioned for X-gal staining and the paraffin sections 
are H&E stained for histopathic analysis. At each time point, terminal bleeding was 
performed. Serum samples were subjected to liver function tests. 

It was observed in this experiment the chimpanzee adenoviruses Pan-6, Pan-7, 
and Pan-9 were less efficient than huAd5 in gene transfer to the liver and to the lung. 

25 However, this may be desirable in certain circumstances, to reduce liver toxicity observed 
for huAd5. The gene transfer efficiency in muscle varied less between serotypes. 

B. Mouse study to feasibility of re-administration of adenovirus vectors by 
serotype switching between Adhu 5, Pan-6, Pan-7, and Pan-9 vectors 

Mice were administered (C57/B16; 4/group) LacZ vectors based on 
30 huAd5, Pan-6, Pan-7, and Pan-9 (H5.040CMVLacZ, Pan6.000CMVLacZ, 

Pan7.000CMVLacZ, Pan9.000CMVLacZ; 10 H particles/injection) by tail vein. Thirty 
days later the mice were re-administered adenovirus vectors expressing a 1 -antitrypsin 
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(H5.040CMVhAlAT, Pan6.000CMVhAlAT, 1x10" particles, Pan7.000CMVhAlAT, 
Pan9.000CMVhAlAT, 10 1 'particles/injection). Successful transduction by there- 
administered vector is monitored by measuring serum al -antitrypsin on days 3 and 7, 
following re-administration. 

The ability of adenovirus vectors based on huAd5, Pan-6, Pan-7, and Pan- 
9 respectively to transduce the livers of mice in the presence of neutralizing antibodies to 
the other serotypes was determined. The results are tabulated here. 



1 st iniection 

4. ill) WLIUl 1 


2 nd iniertinn 


^ross-ncutraiizaiion 


A JL n f 

Adnuj 


Adhu5 


Yes (+ve control) 




Pan-6 


No 




ran-/ 


No 




ran-y (Coo) 


No 


r <ui~u 


/\anu3 


NO 




Don C 

ran-o 


Yes (+ve control) 




Pan-7 - 


Yes 




Pan-9 (C68) 


No 


Pan-7 


Adhu5 


No 




Pan-6 


Yes 




Pan-7 


Yes (+ve control) 




Pan-9 (C68) 


Yes 



71 



WO 03/046124 



PCT/US02/33645 



1 St • ■ j_* 

1 injection 


2 nd injection 


Cross-neutralization 


Pan-9 (C68) 


Adhu5 


No 




Pan-6 


No 




Pan-7 


Yes 




Pan-9 (C68) 


Yes (+ve control) 



Ability of vectors to transduce murine liver in the presence of neutralizing 
antibodies to other serotypes. 
5 Thus, immunization with huAd5 does not prevent re-administration with either of 

the chimpanzee adenovirus vectors Pan-6, Pan-7, or Pan-9 (C68). This experiment also 
appears to indicate that Pan-7 is between Pan-6 and Pan-9 in the spectrum of antigenic 
relatedness and cross-reacts with both; however Pan-6 and Pan-9 do not neutralize each 
other. This is a surprising result based on homology comparisons, which indicates that 
10 Pan-6 is quite distinct from Pan-7 and Pan-9. Evaluation of antisera generated against 

Pan-9 indicated no cross-neutralization against Pan-6 but some neutralization against Pan- 
7, arguing that Pan-6 is distinct from the others. 

1 5 Example 1 0 - Generation of Recombinant El -Deleted S V-25 Vector 

A plasmid was constructed containing the complete SV-25 genome except for an 
engineered El deletion. At the site of the El deletion recognition sites for the restriction 
enzymes I-Ceul and Pl-Scel which would allow insertion of transgene from a shuttle 
plasmid where the transgene expression cassette is flanked by these two enzyme 

20 recognition sites were inserted. 

A synthetic linker containing the restriction sites Swal-SnaBI-Spel-Aflll-EcoRV- 
Swal was cloned into pBR322 that was cut with EcoRI and Ndel. This was done by 
annealing together two synthetic oligomers SV25T (5'-AAT TTA A AT ACG TAG CGC 
ACT AGT CGC GCT AAG CGC GGA TAT CAT TTA AA-3 \ SEQ ID NO: 38) and 

25 SV25B (5>-TAT TTA AAT GAT ATC CGC GCT TAA GCG CGA CTA GTG CGC 



WO 03/046124 



PCT/US02/33645 



TAC GTA TTT A-3', SEQ ID NO:39) and inserting it into pBR322 digested with EcoRI 
and Ndel. The left end (bpl to 1057, SEQ ID NO:29) of Ad SV25 was cloned into the 
above linker between the SnaBI and Spel sites. The right end (bp28059 to 3 1042, SEQ 
ID NO: 29) of Ad SV25 was cloned into the linker between the Aflll and EcoRV sites. 
5 The adenovirus El was then excised between the EcoRI site (bp 547) to Xhol (bp 203 1) 
from the cloned left end as follows. A PCR generated 1-CeuI-PI-SceI cassette from 
pShuttle (Clontech) was inserted between the EcoRI and Spel sites. The 10154 bp Xhol 
fragment of Ad SV-25 (bp2031 to 12185, SEQ ID NO:29) was then inserted into the Spel 
site. The resulting plasmid was digested with Hindlll and the construct (pSV25) was 

10 completed by inserting the 18344 bp Ad SV-25 Hindlll fragment (bpl 1984 to 30328, 
SEQ ID NO:29) to generate a complete molecular clone of El deleted adenovirus SV25 
suitable for the generation of recombinant adenoviruses. Optionally, a desired transgene 
is inserted into the I-Ceul and Pl-Scel sites of the newly created pSV25 vector plasmid. 
To generate an AdSV25 carrying a marker gene, a GFP (green fluorescent 

15 protein) expression cassette previously cloned in the plasmid pShuttle (Clontech) was 
excised with the restriction enzymes I-Ceul and Pl-Scel and ligated into pSV25 (or 
another of the Ad chimp plasmids described herein) digested with the same enzymes. 
The resulting plasmid (pSV25GFP) was digested with Swal to separate the bacterial 
plasmid backbone and transfected into the El complementing cell line HEK 293. About 

20 10 days later, a cytopathic effect was observed indicating the presence of replicative 

virus. The successful generation of an Ad SV25 based adenoviral vector expressing GFP 
was confirmed by applying the supernatant from the transfected culture on to fresh cell 
cultures. The presence of secondarily infected cells was determined by observation of 
green fluorescence in a population of the cells. 

25 

Example 1 1 - Construction of E3 deleted Pan-5, Pan-6, Pan-7 and C68 vectors 

In order to enhance the cloning capacity of the adenoviral vectors, the E3 region 
can be deleted because this region encodes genes that are not required for the propagation 
of the virus in culture. Towards this end, E3-deleted versions of Pan-5, Pan-6, Pan-7, and 
30 C68 have been made (a 3.5 kb Nru-Avrll fragment containing E3 1-9 is deleted). 
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A. E3 deleted Pan5 based vector 

El-deleted pPan5-pkGFP plasmid was treated with Avr II endonuclease to 
isolate a 5.8 kb fragment containing the E3 region and re-circulate pPan5-pkGFP with 
Avr II deletion to form construct pPan5-pkGFP-E3-Avr II. Subsequently, the 5.8 kb Avr 
5 H fragment was subcloned into P SL-Pan5-E3-Avr II for a further deletion of E3 region by 
Nru I digestion. This led to a plasmid P SL-Pan5-E3-deIetion. The final construct P Pan5- 
E3-pkGFP was produced by removing a 4.3 kb Avr II/Spe I fragment from P SL-Pan5-E3- 
deletion plasmid and inserting into pPan5-pkGFP-E3-Avr II at Avr U site. In this final 
construct, a 3.1 kb deletion in E3 region was accomplished. 
10 B. E3 deletion in Pan6 based vector 

El-deleted pPan6- pkGFP molecular clone was digested with Sbf I and 
Not I to isolate 19.3 kb fragment and ligated back at Sbf I site. The resulting construct 
pPan6-Sbf I-E3 was treated with Eco 47 III and Swa I, generating pPan6-E3. Finally, 21 
kb Sbf I fragment from Sbf I digestion of P Pan6- pkGFP was subcloned into P Pan6-E3 to 
15 create pPan6-E3 -pkGFP with a 4 kb deletion in E3. 

C. E3 deleted Pan7 and Pan9 vectors 

The same strategy was used to achieve E3 deletions in both vectors. First, 
a 5.8 kb Avr II fragment spanning the E3 region was subcloned pSL-1 180, followed by 
deletion of E3 by Nru I digestion. The resulting plasmids were treated with Spe I and Avr 
20 II to obtain 4.4 kb fragments and clone into pPan7- pkGFP and pPan9-pkGFP at Avr II 
sites to replace the original E3 containing Avr II fragments, respectively. The final pPan7- 
E3- pkGFP and pPan9-E3- pkGFP constructs have 3.5 kb E3-deletions. 

Example 12 - Construction of E3- and E4-deleted Pan-7 vector 
2 5 Although the deletion of the E 1 region of adenoviruses (first generation 

adenovirus vectors) renders them replication-incompetent, expression of the adenoviral 
vector backbone genes is not fully abolished. Deletion of the E4 region considerably 
attenuates this residual gene expression and may confer a safety advantage. An E4- 
deleted Pan-7 vector containing a 2.5 kb deletion (a PvuH-Agel fragment containing 
30 E40RF1 -ORF7 is deleted) has been constructed. High titer stocks of this virus were 
generated using a HEK 293-based cell line, which in addition to El, expresses an 
essential E4 gene (orf 6). 
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1 . E4 deletion in the molecular clone of Pan7 

A 19 kb Xba I fragment was deleted from pPan7- pkGFP to create pPan7- 
Xba I from which a 2.5 kb E4 fragment was deleted by Age I and Pvu II partial digestion, 
resulting in pPan7-Xba I-E4. pPan7-E4- pkGFP plasmid was generated from pPan7-Xba 
5 I-E4 in two sequential cloning steps, adding 1 9 kb Xba I and 1 5 kb I-Ceu I/Mlu 

I fragments, both of which came from pPan7- pkGFP construct. 

2. Introduction of E3 and E4 deletions in Pan9 vector 

A 1 1 kb plasmid, pPan9-EcoRI, containing E4 region was created by 
retrieving 1 1 kb EcoRI fragment from pPan9 pkGFP after EcoRI digestion and self- 
10 ligation. E4 region was deleted from this construct by Age I digestion/filled in and Pvu 

II partial digestion and slef-ligation to generate pPan9-EcoR I-E4. A 23 kb 
EcoR I fragment was isolated from pPan9-pkGFP and inserted into pPan9-EcoR 
I-E4 at EcoR I site, followed by adding 5.8 kb Avr II fragment from 
pPan9-pkGFP, to form the final product pPan9-E3-E4- pkGF. Compared to the 

1 5 genome size of wild type Pan9, this El -E3-E4-deleted vector could have a 
transgene capacity up to 8 kb. 

3. Introduction ofminigene cassettes with genes of interest including 
reporter genes, glyco- and nuclear proteins of Ebo into molecular clones of Pan 
vectors 

20 A highly efficient direct cloning and green/white selection procedure was 

employed for creating molecular clones of recombinant viruses. Briefly, genes of interest 
were cloned into pShuttlepkGFP by screening white colonies for recombinants. 
Subsequently, the minigene cassettes were transferred into chimpanzee 
adenovirus backbone plasmids, pPanX-pkGFP with various deletions, easily by 

2 5 swapping with pkGFP cassette at I-Ceu I and Pl-Sce I sites and screening a 
few white colonies for correct recombinants. 

4. Rescue of molecular clones of Pan vectors with multiple deletions in 
early regions and virus propagation 

For rescue of El-E3-deleted molecular clones of chimpanzee adenovirus 
30 vectors, the clones were linearized with appropriate restriction enzymes and transfected 
into regular 293 cells. Once a full cytopathic effect (CPE) observed in the transfected 
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cells, crude lysate was harvested and expanded in 293 cells to large-scale infections. The 
viruses were purified by CsCl sedimentation method. 

For E1-E4 and El -E3-E4-deleted Pan vectors, 1 0-3 cells, a 293-based 
El-E4-complementing cell line, were used for rescue and propagation of vectors. E4 
5 ORF6 gene expression in 10-3 cells was induced by addition of 150 \iM ZnSO 4 to the 
culture medium. 

Example 13 - Vaccination with adenovirus vectors expressing wild type and variant 
EboZGP. 

10 AdHu5 or AdC7 vectors expressing Ebola envelope chimeras were 

produced for in vivo immunization experiments in C57BL/6 mice. Recombinant 
viruses with different viral backbones were created by molecular cloning method 
in which the minigene cassettes were inserted into the place of El -deletions. The 
molecular clones of all recombinant viruses were rescued and grown up in 293 

15 cells for large-scale purification using CsCl sedimentation method. 

Five EboZ variants encoded by AdHu5 or AdPan7 (C7) were selected and 
produced to evaluate their relative immunogenicity following an intramuscular 
Ad injection. The wt Ebo, a soluble Ebo variant, EboAl, EboA2, EboA3, EboA4, 
EboASS, EboA6S, EboA7S and EboA8S were evaluated in the initial vaccine 

20 studies. For the data summarized in the following table, the number of viral 
particles (per mi or total) produced and amplified from infected 293 cells was 
established by spectrophotometry reading. 
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Table: Production of AdhuS or AdC7 Adenoviral vector encoding EboZ variant. 





HuAd5 




AdC7 




Gene 


Titer 
(VP x 
10 l2 /ml) 


Total 
yield 
(VPx 
10 12 ) 


Titer 

(VPx 

10 ,2 /ml) 


Total 
yield 
(VPx 
10 12 ) 


Ebo wt 


2.6 


12 


4.3 


43 


EboS 


4.9 


49 


4.6 


55 


EboA2 


2.1 


9 


5.8 


93 


EboA3 


1.7 


8 


5.3 


95 


EboA4 


3 


12 


4.1 


62 



Vector was administered intramuscularly (10 n genome copies/cell) in C57BL/6 
5 mice and the presence of virus neutralizing antibody (VNAO was evaluated 28 days later 
as a first measure of an immune response generated against the Ebola envelope 
glycoprotein. VNA is defined here as serum antibody able to inhibit transduction of HeLa 
cells mediated by HIV-based vector pseudotyped with the wild-type Ebola envelope. 

VNA to the EboZ pseudotypes was detected with AdPan7 (C7) yielding higher 
10 titers than AdHu5. The EboZA3 elicited the highest VNA in terms of the transgene 
targets. For the data summarized in the following table, neutralizing antibody titers to 
HIV-EboZ-GFP pseudotypes (reciprocal dilution) are provided (N=5 animals/group). 

VNA Titers 

EboZ wildtype EboZs EboZA3 

AdHu5 12 16 12 

AdC7 44 12 140 
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Example 14 - Pan7-mediated Expression of Ebola Proteins 

Mouse studies to evaluate Pan-7 vectors expressing Ebola envelope proteins and 
the Ebola nuclear antigen have been initiated. These are directed towards evaluation of 
neutralizing antibodies in C57B1/6 mice injected intramuscularly (IM) with AdhuS or 
5 Pan-7 expressing each of 4 Ebola env constructs. 

A. Evaluation ofCTLfrom C57BI/6 mice injected IMwith AdhuS or Pan- 
7 expressing the Ebola env consfructs. 

1 . Challenge experiment in mice with Ebola virus. 

Neutralizing antibody (NAB) responses to the Ebola envelope 

10 were analyzed by looking at immunized mouse sera mediated neutralization of a lentiviral 
(HIV) vector pseudotyped with the several constructs (eEbo, NTD2, NTD3, NTD4) of the 
Ebola envelope glycoprotein. C57BL/6 or BALB/c mice received a single intramuscular 
injection of 5 x 10 10 particles per mouse of C7 (Ad Pan-7) encoding Ebola envelope 
variant. Neutralizing antibody was evaluated 30 days post-vaccination. Briefly, Ebola 

15 Zaire pseudotyped HIV vector encoding for B-galactosidase (EboZ-HIV-LacZ) was 
incubated for 2 hr at 37°C with different dilution of heat inactivated mouse serum.. 
Following the incubation with serum, EboZ-HIV-LacZ was then used to infect HeLa cells 
for 16 hr at 37°C. Infectivity was revealed by X-gal staining of transduced HeLa cells 
positive for 6-galactosidase. Neutralizing titer represent the serum reciprocal dilution 

20 where a 50% decrease in the number of B-galactosidase positive blue cells was observed. 
Sera were collected 30 days post-immunization, which consisted in a single intramuscular 
(I.M.) administration of 5 x 10 10 particles/animal. Neutralizing antibody to Ebola 
pseudotyped HIV could be detected from all groups with antibody titers ranging from 20 
for Ad-EboZ (Adhu5 expressing EboZ), Ad-NTD3 (AdhuS expressing NTD3) and C7- 

25 sEbo (Ad Pan-7 expressing soluble EboZ) to over 130 for C7-NTD3 (Ad Pan-7 

expressing soluble NTD3) and C7-NTD4 (Ad Pan-7 expressing soluble NTD3). The same 
immunization strategy in BALB/c mice resulted in lower neutralizing antibody titers for 
Ad- and C7-NTD2, and NTD4. 

B. Cellular Immune Response 

30 The cellular immune response to the Ebola envelope in C57BL/6 mice 

was evaluated 8 days after a single I.M. administration of 5 x 10 10 particles of C7-LacZ or 
C7-Ebola envelope variant per animal. Mice were vaccinated I.M. with 5 x 10 10 particles 



WO 03/046124 



PCT/US02/33645 



of C7 encoding LacZ or Ebola envelope variant. Splenic lymphocytes from immunized 
mice were collected 8 days post vaccination and stimulated in vitro with feeder cells 
(splenic lymphocytes from untreated mice infected with human Adenovirus serotype 5 
encoding for the wild-type Ebola envelope and irradiated). Standard 5-hr CTL assays 
5 were performed using 51 Cr-labeled syngenic C57 cells transfected with an expressor of 
EboZ. 

A positive MHC-restricted cytotoxic T lymphocyte (CTL) response was observed 
from all AdPan-7 encoding for Ebola envelope variants with a higher response from 
NTD2, NTD3 or NTD4 immunized mice. Indeed, effector cells from C7 encoding Ebola 

10 envelope variant immunized mice recognized EboZ transfected target cells and gave 

recall CTL responses up to 30% specific lysis. Less than 5% lysis was seen with effector 
cells from naive or LacZ immunized control mice confirming that lysis was specific for 
Ebola envelope antigens. 

C Protection Studies 

15 The most direct means of evaluating C7 (Ad Pan-7) encoding for the 

EboZ variants as a successful vaccine in mice was to assess protection against weight loss 
and death following lethal challenge with mouse adapted Ebola Zaire virus. BALB/c mice 
were immunized with a single dose of 5 x 10 10 particles per animal as performed 
previously and vaccinated animals were challenged with 200 LD50 of mouse adapted 

20 Ebola Zaire 21 days later. All control mice (vehicle and C7-LacZ) died between day 5 to 
day 9 post-challenge. In contrast, all vaccinated mice but one, (from the C7-sEbo group), 
survived the challenge with Ebola Zaire. 

Weight loss was observed from mice vaccinated with C7-sEbo from day 4 to day 
7. Signs of illness such as pilo-erection and from light to severe lethargy were also noted 

25 from mice vaccinated with C7-sEbo, NTD2 and NTD3 between day 4 to day 7. Mice 
immunized with C7-EboZ and C7-NTD4 did not show sign of illness. Overall, a single 
dose of C7-EboZ and C7-NTD4 completely protected immunized mice from illness and 
death possibly due to a significant T cell mediated immunity. 
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All documents recited above are incorporated herein by reference. Numerous 
modifications and variations of the present invention are included in the scope of the 
above-identified specification and are expected to be obvious to one of skill in the art. 
5 Such modifications and alterations to the compositions and processes of the present 
invention, such as selections of different minigenes or selection or dosage of the 
vectors or immune modulators are believed to be within the scope of the claims 
appended hereto. 
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WHAT IS CLAIMED IS: 

1. An isolated simian adenovirus nucleic acid sequence selected from the 
group consisting of: 

(a) Pan5 having the sequence of nucleic acids 1 to 36462 of SEQ ID 

NO:l; 

(b) Pan6 having the sequence of nucleic acids 1 to 36604 of SEQ ID 

NO:5; 

(c) Pan7 having the sequence of nucleic acids 1 to 36535 of SEQ ID 

NO:9; 

(d) SV1 having the sequence of nucleic acids 1 to 34264 of SEQ ID NO: 

24; 

(e) SV25 having the sequence of nucleic acids I to 31044 of SEQ ID NO: 

29; 

(f) SV39 having the sequence of to nucleic acids 1 to 341 1 5 of SEQ ID 

NO: 34, and 

(g) a nucleic acid sequence complementary to the sequence of any of (a) 

to(f). 

2. An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) 5' inverted terminal repeat (ITR) sequences; 

(b) the adenovirus El a region, or a fragment thereof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, DC, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, agnoprotein, 52/55 kD 
protein, and Ufa protein; 
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(f) the L2 region, or a fragment thereof selected from the group 
consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the group 
consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 

(i) the L4 region, or a fragment thereof selected from the group 
consisting of the 100 kD protein, the 33 kD homolog, and VIE; 

(j) the E3 region, or a fragment thereof selected from the group 
consisting of E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5, E3 ORF6, E3 
ORF7, E3 ORF8, and E3 ORF9; 

(k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(1) the E4 region, or a fragment thereof selected from the group 
consisting of E4 ORP7, E4 ORF6, E4 ORF4, E4 ORF3, E4 ORF2, and E4 ORFl;and 

(m) the 3' ITR, 

of any of Pan5 SEQ ID NO:l; Pan6 SEQ ID NO:5; Pan7 SEQ ID NO:9; 
SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ ID NO: 34, or sequence 
complementary to any of (a) to (m). 

3. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 2. 

4. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 2. 

5. The nucleic acid molecule according to claim 4, wherein said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product in a 
host cells. 
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6. The nucleic acid molecule according to claim 4 or 5, wherein said simian 
adenoviral sequence comprises the El a region of Pan5 SEQ ID NO:l; Pan6 SEQ ID 
NO:5; Pan7 SEQ ID NO:9; SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ 
ID NO: 34. 

7. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 6 and a physiologically compatible carrier. 

8. An isolated simian adenoviral capsid protein selected from the group 
consisting of: 

(a) a hexon protein of Pan5 SEQ ID NO:3, Pan6 SEQ ID NO:7, 
Pan7 SEQ ID NO:ll, SV1 SEQ ED NO:26, SV25 SEQ ID NO:31or SV39 SEQ ID 
NO:36, or fragment thereof; 

(b) a penton protein of Pan5 SEQ ID NO:2, Pan6 SEQ ID 
NO:6, Pan7 SEQ ID NO:10, SV1 SEQ ID NO:25, SV25 SEQ ID NO: 30 or SV39 
SEQIDNO:35; 

(c) a fiber protein of Pan5 SEQ ED NO:4, Pan6 SEQ ED NO:8, 
Pan7 SEQ ED NO: 12, SV1 SEQ ED NO: 27 and SEQ ID NO:28, SV25 SEQ ID NO: 
32 and SEQ ED NO:33 or SV39 SEQ ED NO: 37, or a fragment thereof. 

9. An artificial adenovirus serotype comprising a capsid protein 
according to claim 8 or a fragment thereof. 

10. The artificial adenovirus serotype according to claim 9, wherein said 
capsid comprises a fragment of the hexon protein selected from the group consisting 
of Pan5 SEQ ED NO: 15, Pan6 SEQ ID NO: 16 and Pan7 SEQ ED NO: 17. 

1 1 . The artificial adenovirus serotype according to claim 9 or 1 0, wherein 
said capsid comprises a fragment of the fiber protein selected from the group 
consisting of Pan6 SEQ ED NO: 19, Pan7 SEQ ED NO:20 and Pan5 SEQ ED NO:21. 
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12. A nucleic acid molecule comprising a heterologous sequence encoding 
a protein according to claims 3 or 8 or an artificial adenovirus serotype according to 
any of claims 9 to 11. 



13. A recombinant vector comprising a simian adenovirus sequence 
according to claim 2 or a nucleic acid molecule according to claim 4 or 12 and a 
heterologous gene operatively linked to sequences which direct expression of said 
gene in a host cell. 



14. The recombinant vector according to claim 13, further comprising 5' 
and 3' adenovirus cis-elements necessary for replication and encapsidation. 

15. The recombinant vector according to claim 1 3 or claim 1 4, wherein 
said vector is a virus. 



1 6. The recombinant vector according to any of claims 1 3 to 1 5 wherein 
said vector lacks all or a part of the El gene. 

17. A recombinant virus comprising a simian capsid protein according to 
. claim 3 or an artificial adenovirus serotype according to any of claims 9 to 1 1. 

18. A host cell comprising a nucleic acid molecule according to any of 
claims 4 to 6 or 12 , a recombinant vector according to any of claims 13 to 16, or a 
recombinant virus according to claim 17. 

1 9. The host cell according to claim 1 8, wherein said host cell is stably 
transformed with the nucleic acid molecule or the recombinant vector. 
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20. The host cell according to claim 1 8 or claim 19, wherein said host cell 
expresses one or more adenoviral gene products from said nucleic acid molecule or 
recombinant vector, said adenoviral gene products selected from the group consisting 
of El a, Elb, E2a, and E4 ORF6. 

21 . The host cell according to any of claims 1 8 to 20, wherein said host 
cell is stably transformed with a nucleic acid molecule comprising the simian 
adenovirus inverted terminal repeats. 

22. A composition comprising a recombinant vector in a pharmaceutically 
acceptable carrier, said vector comprising a simian adenovirus sequence according to 
any of claims 1 or 2 and a selected heterologous gene operatively linked to regulatory 
sequences which direct expression of said gene in a host cell. 

23. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the vector of any of 
claims 13 to 16 or a virus according to claim 17. 

24. A method for repeat administration of a heterologous gene to a 
mammal comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a virus according 
to claim 17 and wherein the first and second recombinant vector are different. 

25 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the vector of any of claims 13 to 16 or a virus according to 
claim 17, culturing said cell under suitable conditions and recovering from said cell 
culture the expressed gene product. 
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26. A method for eliciting an immune response in a mammalian host 
against an infective agent comprising administering to said host an effective amount 
of the recombinant adenovirus of claim 17, wherein said heterologous gene encodes 
an antigen of the infective agent. 

27. The method according to claim 26, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering the 
recombinant adenovirus . 

28. A composition comprising a simian adenovirus capsid protein 
according to claim 8 linked to a heterologous molecule for delivery to a selected host 
cell. 

29. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject a composition according to claim 28. 
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SEQUENCE LISTING 

<110> The Trustees of the University of Pennsylvania 
Wilson, James M. 
Gao , Guangping 
Roy, Soumitra 

<120> Simian Adenovirus Nucleic Acid and Amino Acid Sequences, 
Vectors Containing Same, and Methods of Use 

<130> UPN-02677PCT 

<150> US 60/331,951 
<151> 2001-11-21 

<150> US 60/366,798 
<151> 2002-03-22 

<160> 39 

<170> Patentln version 3.1 

<210> 1 

<211> 36462 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan5 

<220> 

<221> CDS 

<222> (13898) . . (15490) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18315) . . (21116) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32035) (33372) 

<223> L5 Fiber 

<400> 1 



catcatcaat 


aatatacctc aaacttttgg 


tgcgcgttaa tatgcaaatg aggtatttga 


60 


atttggggat 


gcggggcggt gattggctgc 


gggagcggcg accgttaggg gcggggcggg 


120 


tgacgttttg 


atgacgtggc cgtgaggcgg 


agccggtttg caagttctcg tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt ttgaacacgg 


aaatactcaa ttttcccgcg ctctctgaca 


240 


ggaaatgagg 


tgtttctggg cggatgcaag 


tgaaaacggg ccattttcgc gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg agtaattccg 


cgtttatggc agggaggagt atttgccgag 


360 


ggccgagtag 


actttgaccg attacgtggg 


ggtttcgatt accgtatttt tcacctaaat 


420 
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ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660 

gtgacgaccc tccggagccc cctaccccat ttgaagcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca gcgaaccagg gagtgaaaac agcgagcgag ggctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagctgggga ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgtagatgag acccccacta 1260 

cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattca'ta 1320 

gaccagttgc agtgagagtc accgggcgta gagcagctgt ggagagtttg gatgacttgc 1380 

tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440 

cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500 

tccgtgttga ctttaagtgc gtggtttatg actcaggggt ggggactgtg ggtatataag 1560 

caggtgcaga cctgtgtggt cagttcagag caggactcat ggagatctgg acagtcttgg 1620 

aagactttca ccagactaga cagctgctag agaactcatc ggagggagtc tcttacctgt 1680 

ggagattctg cttcggtggg cctctagcta agctagtcta tagggccaag caggattata 1740 

aggatcaatt tgaggatatt ttgagagagt gtcctggtat ttttgactct ctcaacttgg 1800 

gccatcagtc tcactttaac cagagtattc tgagagccct tgacttttct actcctggca 1860 

gaactaccgc cgcggtagcc ttttttgcct ttatccttga caaatggagt caagaaaccc 1920 

atttcagcag ggattaccgt ctggactgct tagcagtagc tttgtggaga acatggaggt 1980 

gccagcgcct gaatgcaatc tccggctact tgccagtaca gccggtagac acgctgagga 2040 

tcctgagtct ccagtcaccc caggaacacc aacgccgcca gcagccgcag caggagcagc 2100 
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agcaagagga ggaccgagaa gagaacctga gagccggtct ggaccctccg gtggcggagg 2160 

aggaggagta gctgacttgt ttcccgagct gcgccgggtg ctgactaggt cttccagtgg 2220 

acgggagagg gggattaagc gggagaggca tgaggagact agccacagaa ctgaactgac 2280 

tgtcagtctg atgagtcgca ggcgcccaga atcggtgtgg tggcatgagg tgcagtcgca 2340 

ggggatagat gaggtctcag tgatgcatga gaaatattcc ctagaacaag tcaagacttg 2400 

ttggttggag cccgaggatg attgggaggt agccatcagg aattatgcca agctggctct 2460 

gaggccagac aagaagtaca agattaccaa actgattaat atcagaaatt cctgctacat 2520 

ttcagggaat ggggccgagg tggagatcag tacccaggag agggtggcct tcagatgctg 2580 

catgatgaat atgtacccgg gggtggtggg catggaggga gtcaccttta tgaacgcgag 2640 

gttcaggggt gatgggtata atggggtggt ctttatggcc aacaccaagc tgacagtgca 2700 

cggatgctcc ttctttggct tcaataacat gtgcattgag gcctggggca gtgtttcagt 2760 

gaggggatgc agtttttcag ccaactggat gggggtcgtg ggcagaacca agagcatggt 2820 

gtcagtgaag aaatgcctgt tcgagaggtg ccacctgggg gtgatgagcg agggcgaagc 2880 

caaagtcaaa cactgcgcct ctaccgagac gggctgcttt gtactgatca agggcaatgc 2940 

caaagtcaag cataatatga tctgtggggc ctcggatgag cgcggctacc agatgctgac 3000 

ctgcgccggt gggaacagcc atatgctagc caccgtgcat gtggcctcgc acccccgcaa 3060 

gacatggccc gagttcgagc acaacgtcat gacccgctgc aatgtgcacc tggggtcccg 3120 

ccgaggcatg ttcatgccct accagtgcaa catgcaattt gtgaaggtgc tgctggagcc 3180 

cgatgccatg tccagagtga gcctgacggg ggtgtttgac atgaatgtgg agctgtggaa 3240 

aattctgaga tatgatgaat ccaagaccag gtgccgggcc tgcgaatgcg gaggcaagca 3300 

cgccaggctt cagcccgtgt gtgtggaggt gacggaggac ctgcgacccg atcatttggt 3360 

gttgtcctgc aacgggacgg agttcggctc cagcggggaa gaatctgact agagtgagta 3420 

gtgtttggga ctgggtggga gcctgcatga tgggcagaat gactaaaatc tgtgtttttc 3480 

tgcgcagcag catgagcgga agcgcctcct ttgagggagg ggtattcagc ccttatctga 3540 

cggggcgtct cccctcctgg gcgggagtgc gtcagaatgt gatgggatcc acggtggacg 3600 

gccggcccgt gcagcccgcg aactcttcaa ccctgaccta cgcgaccctg agctcctcgt 3660 

ccgtggacgc agctgccgcc gcagctgctg cttccgccgc cagcgccgtg cgcggaatgg 3720 

ccctgggcgc cggctactac agctctctgg tggccaactc gagttccacc aataatcccg 3780 

ccagcctgaa cgaggagaag ctgctgctgc tgatggccca gctcgaggcc ctgacccagc 3840 



3 



WO 03/046124 



PCT/US02/33645 



gcctgggcga 


gctgacccag 


caggtggctc 


agctgcaggc 


ggagacgcgg 


gccgcggttg 


3900 


ccacggtgaa 


aaccaaataa 


aaaatgaatc 


aataaataaa 


cggagacggt 


tgttgatttt 


3960 


aacacagagt 


cttgaatctt 


tatttgattt 


ttcgcgcgcg 


gtaggccctg 


gaccaccggt 


4020 


ctcgatcatt 


gagcacccgg 


tggatctttt 


ccaggacccg 


gtagaggtgg 


gcttggatgt 


4080 


tgaggtacat 


gggcatgagc 


ccgtcccggg 


ggtggaggta 


gctccattgc 


agggcctcgt 


4140 


gctcgggggt 


ggtgttgtaa 


atcacccagt 


catagcaggg 


gcgcagggcg 


tggtgctgca 


4200 


cgatgtcctt 


gaggaggaga 


ctgatggcca 


cgggcagccc 


cttggtgtag 


gtgttgacga 


4260 


acctgttgag 


ctgggaggga 


tgcatgcggg 


gggagatgag 


atgcatcttg 


gcctggatct 


4320 


tgagattggc 


gatgttcccg 


cccagatccc 


gccgggggtt 


catgttgtgc 


aggaccacca 


4380 


gcacggtgta 


tccggtgcac 


ttggggaatt 


tgtcatgcaa 


cttggaaggg 


aaggcgtgaa 


4440 


agaatttgga 


gacgcccttg 


tgaccgccca 


ggttttccat 


gcactcatcc 


atgatgatgg 


4500 


cgatgggccc 


gtgggcggcg 


gcttgggcaa 


agacgtttcg 


ggggtcggac 


acatcgtagt 


4560 


tgtggtcctg 


ggtgagctcg 


tcataggcca 


ttttaatgaa 


tttggggcgg 


agggtgcccg 


4620 


actgggggac 


gaaggtgccc 


tcgatcccgg 


gggcgtagtt 


gccctcgcag 


atctgcatct 


4680 


cccaggcctt 


gagctcggag 


ggggggatca 


tgtccacctg 


cggggcgatg 


aaaaaaacgg 


4740 


tttccggggc 


gggggagatg 


agctgggccg 


aaagcaggtt 


ccggagcagc 


tgggacttgc 


4800 


cgcagccggt 


ggggccgtag 


atgaccccga 


tgaccggctg 


caggtggtag 


ttgagggaga 


4860 


gacagctgcc 


gtcctcgcgg 


aggagggggg 


ccacctcgtt 


catcatctcg 


cgcacatgca 


4920 


tgttctcgcg 


cacgagttcc 


gccaggaggc 


gctcgccccc 


aagcgagagg 


> 

agctcttgca 


4980 


gcgaggcgaa 


gtttttcagc 


ggcttgagcc 


cgtcggccat 


gggcattttg 


gagagggtct 


5040 


gttgcaagag 


ttccagacgg 


tcccagagct 


cggtgatgtg 


ctctagggca 


tctcgatcca 


5100 


gcagacctcc 


tcgtttcgcg 


ggttggggcg 


actgcgggag 


tagggcacca 


ggcgatgggc 


5160 


gtccagcgag 


gccagggtcc 


ggtccttcca 


ggggcgcagg 


gtccgcgtca 


gcgtggtctc 


5220 


cgtcacggtg 


aaggggtgcg 


cgccgggctg 


ggcgcttgcg 


agggtgcgct 


tcaggctcat 


5280 


ccggctggtc 


gagaaccgct 


cccggtcggc 


gccctgcgcg 


tcggccaggt 


agcaattgag 


5340 


catgagttcg 


tagttgagcg 


cctcggccgc 


gtggcccttg 


gcgcggagct 


tacctttgga 


5400 


agtgtgtccg 


cagacgggac 


agaggaggga 


cttgagggcg 


tagagcttgg 


gggcgaggaa 


5460 


gacggactcg 


ggggcgtagg 


cgtccgcgcc 


gcagctggcg 


cagacggtct 


cgcactccac 


5520 
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cracrccaercrtcr 

y *-*y "«wu y 3 


acrcrt ctcrcrcc 


erer t ccr crcr crt c 

MM u y 3 3 3 


a a a a a ccracrcr 

a a u a a v** y y u 


tttcctccgt 


actttttaat 


5580 


gcgtttctta 


cct ctggtct 


ccatcracrctc 

Xp* V-fLl X^ XJ XA XA X^ V* 


crtcrtcccccrc 


tgggtgacaa 


agaggctgtc 


5640 


cgtgtccccg 


tagaccgact 


ttatcrcfcrcccr 

w X- a X*> \J \J \J X^> X^ XJ 


crt cctccracrc 


ggggtgccgc 


ggt cct cgtc 


5700 


qtaaacrcfaac 


cccgcccact 


ccgagacgaa 


ggcccgggtc 


caaciccacrca 


caaaacraaac 

^yuuyyuyy^ 


5760 


caccrtQcraacr 

3 ^3 3 3 **3 


gggtagcggt 


cgttgtccac 


cagcgggtcc 


accttctcca 


gggtatgcaa 


5820 


gcacatgtcc 


ccctcgtcca 


catccaggaa 


cr cr t cr a 1 1 crcr c 


ttgtaagtgt 


acrcrccaccrtcr 


5880 


acccrercrcrcrt c 


c ccr crcc crcr crcr 

W» X^rXJ XJ X»* Xj X* W XJ 


cr crcrta taaa a 

y y y mo. oaa a a 


crcrcrcrcrccrcrcrci 


ccctcrct cert 


cct ca ctertc 


5940 


ttcccrcrat ccr 


ctcrt cca erer a 


crccr cca ere ter 

y My ^*^C*U \~ y 


tt crcrcrcrta crcr 

LV -y yyy uca yy 


1~ a t*teect ct 


c er a a erer c er er er 
• y o a y y y y y 


6000 


catcracctccr 


crcact cacrcrt 

y v** ca v— » m vh y y \— 


tcrt ca ertttc 


hacraaarrfafi 

V— ci y add Vw» y 3 


era crcra ttter a 


t a 1" t" era cerert* 


6060 


crccattaaaa 


acgcctttca 


t gagcccctc 


gtccatctgg 


tcagaaaaga 


cgatcttttt 


6120 


attatcaacfc 


ttggtggcga 


aggagcegta 


cr a cr cr cr c cr 1 1 cr 


craoacrcacrct 


tcrcrccratcrcra 

w M U W4. w \J U 


6180 


acacat crate 


taattctttt 


cc t tcrt ccr crc 

X** X^ X— X* XJJ w XJ XJ X^ 


acoctcctto 

^ V-f V** W ^# V— \J 


crccrcrcCTatcrt 

y ^ — y y > — y o. v~y \~ 


tcracrctcrcac 

w y a. y w v^y 


6240 


atactcacac 

XJ W U X-* V* X^ U »w XJ 


crccaccrcact 


tccatt ccr crcr 

m 1 — ■ to. i— ^3 3 ™3 


era a era ccrcrt cr 


cr t era cr r 1 1 er t 


ccrcrerca cera t 

333 3 


6300 


tctgacccgc 


cagccgcggt 


tgtgcagggt 


gatgaggtcc 


a cgctggtgg 


CCaCC^LCCJCC 

t4 X^ L- ■ U 


6360 


ercacacracfcr c 

3 W 3 w "3 3 33 ^* 


tcgttggtcc 


a cr c a q a cr cr c cr 


cccgcccttg 


ccrccraacacra 

V4 y W*. LA \A V-Ai 


acrcrcrcracrcaCT 


6420 


cqaatccaac 

333 ^ ww * 3 


atgagctcgt 


caaaaacrcrtc 

^3333333 


ggcgtccacg 


atcraaaatac 


caaacaaaacr 

X^ V4 *H *3 X^ W4 \J W U SJ 


6480 


ctccraacrtca 

wwv *3333 ^^3 


aagtagctga 


tcrcaacrtacc 


cacrat cert cc 


a acaccerc tt 


CTccacrtcaccr 

y u y w w y w y 


6540 


cacggccagc 


gcgcgctcgt 


a cr cr cr cr c t cr a cr 

y y ^ y ' w u u. u 


crcr o c crt o c cc 

^4 y ^ y v y 


ca crdfTca t rtfr 

V— * u. u U ^»CL \J \J 


erert erccrt era er 

mm uu y v- y cl y 


6600 


ccrcercra crcr ccr 


tacatcrcccrc 

X«» W4> X*' S-4 ^ ^ X-* ^ X^ 


acra tcrt cert a 


era cert a era crcr 


crerct cct cera 


erer a c ere cera t 


6660 


ataaataaaa 

3 wu 3 3 ^3333 


tagcagcgcc 


ccccgcggat 


gctggcgcgc 


aegtagtegt 


acagct cgtg 


6720 


caaaaacaccr 

3 3 3 3 3 w 3 


aacraacccaa 

U 33 M 3 3 3 


taccaaaatt 

3^^3^33 ^ 


aaaacactac 


acfcttttcacr 


cacaataaac 

^y ^y y m & y a. m 


6780 


qatctaacaa 

3 33 33 


aagatggcgt 


aaaaattaaa 

333 3 3 3 


aaaaataata 

33 U 3 M 3 3 3 


aacctctaaa 

yy<w*_v-v-i-yyt-(. 


agatgttgaa 


6840 


qtaacfccrtcrcf 

3 *~ 3 3 3 ^3 *~ 3 3 


ggcagtccga 


ccgagtccct 


gatgaagtgg 


gcgtaggagt 


cc tgeagett 


6900 


cfcrccracaacrc 

3 3 w 3 ^3 " 3 


tcaacaataa 


cgagga cgtc 


cagggegcag 


tagtcgaggg 


tctcttggat 


6960 


gatgtegtae 


ttcracrctcrcf c 

W X* ^ X-4 *^ X^ X* XJ ^-j 


c c 1 1 c t cr c 1 1 

w V— V- V#* V#» V» M V* ^» ^» 


ccacacrctccr 


ccrcrttcf a era a 


ereraactctt c 

y y gl u m m w w m 


7020 


gcggtccttc 


cagtactctt 


cgagggggaa 


cccgtcctga 


teggcaeggt 


aagagcccac 


7080 


catgtagaac 


tggttgacgg 


ccttgtaggc 


gcagcagccc 


ttctccacgg 


ggagggcgta 


7140 


agettgegeg 


gccttgcgca 


gggaggtgtg 


ggtgagggcg 


aaggtgtcgc 


gcaccatgac 


7200 


cttgaggaac 


tggtgcttga 


agtcgaggtc 


gtcgcagccg 


ccctgctccc 


agagctggaa 


7260 
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gtccgtgcgc ttcttgtagg cggggttggg caaagcgaaa gtaacatcgt tgaagaggat 7320 

cttgcccgcg cggggcatga agttgcgagt gatgcggaaa ggctggggca cctcggcccg 7380 

gttgttgatg acctgggcgg cgaggacgat ctcgtcgaag ccgttgatgt tgtgcccgac 7440 

gatgtagagt tccacgaatc gcgggcggcc cttgacgtgg ggcagcttct tgagctcgtc 7500 

gtaggtgagc tcggcggggt cgctgaggcc gtgctgctcg agggcccagt cggcgaggtg 7560 

ggggttggcg ccgaggaagg aagtccagag atccacggcc agggcggtct gcaagcggtc 7620 

ccggtactga cggaactgct ggcccacggc cattttttcg ggggtgacgc agtagaaggt 7680 

gcgggggtcg ccgtgccagc ggtcccactt gagctggagg gcgaggtcgt gggcgagctc 7740 

gacgagcggc gggtccccgg agagtttcat gaccagcatg aaggggacga gctgcttgcc 78 00 

gaaggacccc atccaggtgt aggtttccac gtcgtaggtg aggaagagcc tttcggtgcg 7860 
aggatgcgag ccgatgggga agaactggat ctcctgccac cagttggagg aatggctgtt ' 7920 

gatgtgatgg aagtagaaat gccgacggcg cgccgagcac tcgtgcttgt gtttatacaa 7980 

gcgtccgcag tgctcgcaac gctgcacggg atgcacgtgc tgcacgagct gtacctgggt 8040 

tcctttgacg aggaatttca gtgggcagtg gagcgctggc ggctgcatct ggtgctgtac 8100 

tacgtcctgg ccatcggcgt ggccatcgtc tgcctcgatg gtggtcatgc tgacgaggcc 8160 

gcgcgggagg caggtccaga cctcggctcg gacgggtcgg agagcgagga cgagggcgcg 8220 

caggccggag ctgtccaggg tcctgagacg ctgcggagtc aggtcagtgg gcagcggcgg 8280 

cgcgcggttg acttgcagga gcttttccag ggcgcgcggg aggtccagat ggtacttgat 8340 

ctccacggcg ccgttggtgg cgacgtccac ggcttgcagg gtcccgtgcc cctggggcgc 8400 

caccaccgtg ccccgtttct tcttgggtgc tggcggcggc ggctccatgc ttagaagcgg 8460 

cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 

ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 

gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640 

gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 

tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 

tgctcgatct cctcctcctg aaggtctccg cgaccggcgc gctcgacggt ggccgcgagg 8820 

tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 888 0 

cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 
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agctcgacgt ggcgcgtgaa 


gaccgcgtag 


ttgcagaggc 


gctggtagag 

3 3 3 3 3 


gtagttgagc 

3 3 3 3 


9000 


gtggtggcga tgtgctcggt 


gacgaagaag 


tacatgatcc 


agcggcggag 


cggcatctcg 


9060 


ctgacgtcgc ccagggcttc 


caagcgctcc 


atggcctcgt 


agaagtccac 


ggcqaagttg 

33 3 3 J 


9120 


aaaaactggg agttgcgcgc 


cgagacggtc 


aactcctcct 


ccagaagacg 


gatgagctcg 

3 3 3 3 


9180 


gcgatggtgg cgcgcacctc 


gcgctcgaag 


gccccggggg 


gctcctcttc 


ttccatctcc 


9240 


tcctcctctt ccatctcctc 


cactaacatc 


tcttctactt 


cctcctcagg 


aggeggegge 

33 33 33 


9300 


gggggagggg ccctgcgtcg 


ccggcggcgc 


acgggcagac 


ggtcgatgaa 

3 3 3 3 


Qcqctcqatq 


9360 


gtctccccgc gccggcgacg 


catggtctcg 


ataacqacac 

3 3 3 ^? 3 


gcccgtcctc 


acaacrcrcccfc 


9420 


agcgtgaaga cgccgccgcg 


catctccagg 


tggccgcccrq 

3 3 ~~ 3 33 


qqqqqtctcc 


attaacrcaaa 


9480 


gagagggcgc tgacgatgca 


tcttatcaat 


tggcccgtag 


gqactccaca 

3 3 3 3 


caaggacctg 


9540 


agcgtctcga gatccacggg 


atccgaaaac 


cgctgaacga 


aggcttcgag 

3 3 3 3 


ccagtcgcag 


9600 


tcgcaaggta ggctgagccc 


ggtttcttgt 


tcttcgggta 


tttggtcggg 


aggegggegg 

3 3 3 3 3 3 3 


9660 


gcgatgctgc tggtgatgaa 


gttgaagtag 


gcggtcctga 


gaeggeggat 

3 J 3 3 3 


ggtggcgaqq 

3 3 3 3 3 3 3 


9720 


agcaccaggt ccttgggccc 


ggcttgctgg 

-3 3 3 3 3 


atgcgcagac 


gqtcqqecat 

3 3 3 3 ^ w ^ «■» 


gccccaggcg 


9780 


tggtcctgac acctggcgag 


gtccttgtag 


tagtcctgca 


tgagccgctc 


cacgggcacc 


9840 


tcctcctcgc ccgcgcggcc 


gtgcatgcgc 


gtgagcccga 


acccgcgctg 


caactaaaca 


9900 


agcgccaggt cggcgacgac 


gcgctcggcg 


aggatggcct 


gctggatctg 

3 3 3 3 


qqtqaqqqtq 

33 3333 3 


9960 


gtctggaagt cgtcgaagtc 


gacgaagcgg 


tggtaggctc 


cggtgttgat 


ggtgtaggag 

3 3 3 3 3 3 


10020 


cagttggcca tgacggacca 


gttgacggtc 


tggtggccgg 


ggegcacgag 


ctcgtggtac 


10080 


ttgaggcgcg agtaggcgcg 


cgtgtcgaag 


atgtagtcgt 


tgcaggtgcg 

3 ~ 3 3 3 


cacgaggtac 


10140 


tggtatccga cgaggaagtg 


cggcggcggc 


tggcggtaga 


gcggccatcg 

3 3 3 3 


ctcggtggcg 

3 3 3 3 3 


10200 


ggggcgccgg gcgcgaggtc 


ctcgagcatg 


aggcggtggt 


ageegtagat 


gtacctggac 


10260 


atccaggtga tgccggcggc 


ggtggtggag 


gcgcgcggga 


actcgcggac 


qcqqttccaq 

3 3 3 3 


10320 


atgttgcgca gcggcaggaa 


gtagttcatg 


gtggccgcgg 


tctggcccgt 


qaqqcqcqcq 

3 3 3 w 3 3 3 


10380 


cagtcgtgga tgctctagac 


atacgggcaa 


aaacgaaagc 


gqtcaqcqqc 


tcgactccgt 


10440 


ggcctggagg ctaagcgaac 


gggttgggct 


gcgcgtgtac 


cccggttcga 


gtccctgctc 


10500 


gaatcaggct ggagccgcag 


ctaacgtggt 


actggcactc 


ccgtctcgac 


ccaagcctgc 


10560 


taacgaaacc tccaggatac 


ggaggcgggt 


cgttttggcc 


attttegtea 


ggccggaaat 


10620 


gaaactagta agcgcggaaa 


gcggccgtcc 


gcgatggctc 


getgeegtag 


tctggagaaa 


10680 
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gaatcgccag 


ggttgcgttg 


cggtgtgccc 


cggttcgagc 


ctcagcgctc 


ggcgccggcc 


10740 


ggattccgcg 


gctaacgtgg 


gcgtggctgc 


cccgtcgttt 


ccaagacccc 


ttagccagcc 


10800 


gacttctcca 


gttacggagc 


gagcccctct 


ttttcttgtg 


tttttgccag 


atgcatcccg 


10860 


tactgcggca 


gatgcgcccc 


caccctccac 


cacaaccgcc 


cctaccgcag 


cagcagcaac 


10920 


agccggcgct 


tctgcccccg 


ccccagcagc 


agcagccagc 


cactaccgcg 


gcggccgccg 


10980 


tgagcggagc 


cggcgttcag 


tatgacctgg 


ccttggaaga 


gggcgagggg 


ctggcgcggc 


11040 


tgggggcgtc 


.gtcgccggag 


cggcacccgc 


gcgtgcagat 


gaaaagggac 


gctcgcgagg 


11100 


cctacgtgcc 


caagcagaac 


ctgttcagag 


acaggagcgg 


cgaggagccc 


gaggagatgc 


11160 


gcgcctcccg 


cttccacgcg 


gggcgggagc 


tgcggcgcgg 


cctggaccga 


aagcgggtgc 


11220 


tgagggacga 


ggatttcgag 


gcggacgagc 


tgacggggat 


cagccccgcg 


cgcgcgcacg 


11280 


tggccgcggc 


caacctggtc 


acggcgtacg 


agcagaccgt 


gaaggaggag 


agcaacttcc 


11340 


aaaaatcctt 


caacaaccac 


gtgcgcacgc 


tgatcgcgcg 


cgaggaggtg 


accctgggcc 


11400 


tgatgcacct 


gtgggacctg 


ctggaggcca 


tcgtgcagaa 


ccccacgagc 


aagccgctga 


11460 


cggcgcagct 


gtttctggtg 


gtgcagcaca 


gtcgggacaa 


cgagacgttc 


agggaggcgc 


11520 


tgctgaatat 


caccgagccc 


gagggccgct 


ggctcctgga 


cctggtgaac 


attctgcaga 


11580 


gcatcgtggt 


gcaggagcgc 


gggctgccgc 


tgtccgagaa 


gctggcggcc 


atcaacttct 


11640 


cggtgctgag 


cctgggcaag 


tactacgcta 


ggaagatcta 


caagaccccg 


tacgtgccca 


11700 


tagacaagga 


ggtgaagatc 


gacgggtttt 


acatgcgcat 


gaccctgaaa 


gtgctgaccc 


11760 


tgagcgacga 


tctgggggtg 


taccgcaacg 


acaggatgca 


ccgcgcggtg 


agcgccagcc 


11820 


gccggcgcga 


gctgagcgac 


caggagctga 


tgcacagcct 


gcagcgggcc 


ctgaccgggg 


11880 


ccgggaccga 


gggggagagc 


tactttgaca 


tgggcgcgga 


cctgcgctgg 


cagcctagcc 


11940 


gccgggcctt 


ggaagctgcc 


ggcggttccc 


cctacgtgga 


ggaggtggac 


gatgaggagg 


12000 


aggagggcga 


gtacctggaa 


gactgatggc 


gcgaccgtat 


ttttgctaga 


tgcagcaaca 


12060 


gccaccgccg 


cctcctgatc 


ccgcgatgcg 


ggcggcgctg 


cagagccagc 


cgtccggcat 


12120 


taactcctcg 


gacgattgga 


cccaggccat 


gcaacgcatc 


atggcgctga 


cgacccgcaa 


12180 


tcccgaagcc 


tttagacagc 


agcctcaggc 


caaccgactc 


tcggccatcc 


tggaggccgt 


12240 


ggtgccctcg 


cgctcgaacc 


ccacgcacga 


gaaggtgctg 


gccatcgtga 


acgcgctggt 


12300 


ggagaacaag 


gccatccgcg 


gcgacgaggc 


cgggctggtg 


tacaacgcgc 


tgctggagcg 


12360 
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cgtggcccgc 


tacaacagca 


ccaacgtgca 


gacgaacctg 


qaccgcatqq 

3 3 3 3 


tgaccgacgt 


12420 


gcgcgaggcg 


gtgtcgcagc 


gcgagcggtt 

3^3~3" - ~33 


ccaccgcgag 


tcgaacctgg 


gctccatggt 


12480 


ggcgctgaac 


gccttcctga 


gcacgcagcc 


cgccaacgtg 


ccccqqaqcc 


aqqaqqacta 

" > 33"-33 t *^"^" aL 


12540 


caccaacttc 


atcagcgcgc 


tgcggctgat 

3 3 3 3 


gqtqqccqaq 

3 3 3 3 3 3 


gtgccccaga 


qcqaqqtqta 

3 3 3 3 3 w * w *' 


12600 


ccagtcgggg 


ccggactact 


tcttccagac 


cagtcgccag 


ggcttgcaga 

3 3 3 3 


ccgtgaacct 


12660 


gagccaggct 


ttcaagaact 


tgcagggact 


gtggggcgtg 


caggccccgg 


tcggggaccg 


12720 


cgcgacggtg 


tcgagcctgc 


tgacgccgaa 


ctcgcgcctg 


ctgctgctgc 


tqqtqqcqcc 

3 3 3 3 3 


12780 


cttcacggac 


agcggcagcq 


tqaqccqcqa 

3 3 w ^ 3 3 


ctcgtacctg 


ggctacctgc 


ttaacctgta 


12840 


ccgcgaqqcc 

3 3 f J 


atcqaqcaaa 


cacacataaa 

3 ^3 ^ 3 3 " 


cgagcagacc 


taccaggaga 


tcacccacat 


12900 


gagccgcgcg 

3 3 3 3 3 


ctqqqccaqa 


aggacccggg 


caacc tggag 


gccaccctga 


acttcctcrct 


12960 


gaccaaccgg 


tcgcagaaga 


tcccgcccca 


gtacgcgctg 

3 3 3 3 


aqcaccqaqq 

3 3 33 


aqqaqcqcat 

3 3 3 3 


13020 


cctgcgctac 


gtgcagcaga 


gcgtggggct 


gttcctgatg 


caggaggggg 

3 3 3 3 3 3 3 


ccacgcccag 


13080 


cgccgcgctc 


gacatgaccg 


cgcgcaacat 


ggagcccagc 


atgtacgccc 


gcaaccgccc 


13140 


gttcatcaat 


aagctgatgg 

3 3 3 3 


actacttgca 


tcqgqcqqcc 

333 33 


gccatgaact 


cggactactt 


13200 


taccaacgcc 


at cttgaacc 


cgcactggct 


cccgccgccc 


cracrtt ctaca 

\A \A I* w ^* w Wl 


ccrcrcfccracrta 

3 3 3 3 3 


13260 


cgacatgccc 


gaccccaacg 


acgggttcct 


qtqqqacqac 

3 3 3 3 3 ** w 


ataaacaaca 


gcgtgttctc 


13320 


gccgcgcccc 


accaccacca 


ccgtgtggaa 

3 3 3 3 


qaaaqaqqqc 

3 3 333 


cfggaaccaac 

3 3 3 3 3 3 *"* 


ggccgtcctc 


13380 


ggcgctgtcc 


ggtcgcgcgg 


gtgctgccgc 


ggcggtgccc 


gaggccgcca 


gccccttccc 


13440 


gagcctgccc 


ttttcgctga 


acagcgtgcg 


cagcagcgag 


ctgggtcggc 


tgacgcggcc 


13500 


gcgcctgctg 


ggcgaggagg 


agtacctgaa 


cgactccttg 


cttcggcccg 


agcgcgagaa 


13560 


gaacttcccc 


aataacggga 


tagagagcct 


ggtggacaag 

3 3 3 3 3 


atgagccgct 


ggaagacgta 


13620 


cgcgcacgag 


cacagggacg 


agccccgagc 


tagcagcagc 


accggcgcca 


cccgtagacg 


13680 


ccagcggcac 


gacaggcagc 


ggggtctggt 


gtgggacgat 


gaggattccg 


ccgacgacag 


13740 


cagcgtgttg 


gacttgggtg 


ggagtggtgg 


tggtaacccg 


ttcgctcacc 


tgcgcccccg 


13800 


tatcgggcgc 


ctgatgtaag 


aatctgaaaa 


aataaaagac 


ggtactcacc 


aaggccatgg 


13860 


cgaccagcgt 


gcgttcttct 


ctgttgtttg 


tagtagt atg atg agg cgc gtg tac 


13915 



Met Met Arg Arg Val Tyr 
1 5 



ccg gag ggt cct cct ccc teg tac gag age gtg atg cag cag gcg gtg 13963 
Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val 
10 15 20 
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gcg gcg gcg atg cag ccc ccg ctg gag gcg cct tac gtg ccc ccg egg 14011 
Ala Ala Ala Met Gin Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg 
25 30 " 35 

tac ctg gcg cct acg gag ggg egg aac age att cgt tac teg gag ctg 14059 
Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu 
40 45 50 

gca ccc ttg tac gat acc acc egg ttg tac ctg gtg gac aac aag teg 14107 
Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser 
55 60 ~ 65 " 70 

gcg gac ate gee teg ctg aac tac cag aac gac cac age aac ttc ctg 14155 
Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu 
75 80 85 

acc acc gtg gtg cag aac aac gat ttc acc ccc acg gag gee age acc 14203 
Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr 
90 95 100 

cag acc ate aac ttt gac gag cgc teg egg tgg ggc ggc cag ctg aaa 14251 
Gin Thr He Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys 
105 HO 115 



acc ate atg cac acc aac atg ccc aac gtg aac gag ttc atg tac age 
Thr He Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser 
120 125 130 



14299 



aac aag ttc aag gcg egg gtg atg gtc teg cgc aag acc ccc aac ggg 14347 
Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly 
135 140 145 150 

gtc aca gta aca gat ggt agt cag gac gag ctg acc tac gag tgg gtg 14395 
Val Thr Val Thr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val 
155 160 ~ 165 

gag ttt gag ctg ccc gag ggc aac ttc teg gtg acc atg acc ate gat 14443 
Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr He Asp 
170 175 180 

ctg atg aac aac gee ate ate gac aac tac ttg gcg gtg ggg egg cag 14491 
Leu Met Asn Asn Ala He He Asp Asn Tyr Leu Ala Val Gly Arg Gin 
185 190 195 

aac ggg gtg ctg gag age gac ate ggc gtg aag ttc gac acg cgc aac 14539 
Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn 
200 205 210 

ttc egg ctg ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg 14587 
Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val 
215 220 225 230 

tac acc aac gag gee ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc 14635 
Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys 
235 240 245 
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99 c gtg gac ttc acc gag age cgc etc age aac ctg ctg ggc ate cgc 14683 
Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly lie Arg 
250 255 260 

aag egg cag ccc ttc cag gag ggc ttc cag ate ctg tac gag gac ctg 14731 
Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin lie Leu Tyr Glu Asp Leu 
265 270 275 

gag ggg ggc aac ate ccc gcg ctg ctg gac gtg gac gec tac gag aaa 14779 
Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys 
280 285 290 

age aag gag gat age gec gec gcg gcg acc gca gec gtg gee acc gec 14827 
Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala 
295 300 305 310 

tct acc gag gtg egg ggc gat aat ttt get age gee gcg aca ctg gca 14875 
Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Thr Leu Ala 
315 320 325 

gcg gec gag gcg get gaa acc gaa agt aag ata gtg ate cag ccg gtg 14923 
Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys He Val He Gin Pro Val 
330 335 340 

gag aag gac age aag gag agg age tac aac gtg etc gcg gac aag aaa 14971 
Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys 
345 350 355 

aac acc gec tac cgc age tgg tac ctg gee tac aac tac ggc gac ccc 15019 
Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro 
360 365 370 

gag aag ggc gtg cgc tec tgg acg ctg etc acc acc teg gac gtc acc 15067 
Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr 
375 380 385 * 390 

tgc ggc gtg gag caa gtc tac tgg teg ctg ccc gac atg atg caa gac 15115 
Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 
395 400 405 

ccg gtc acc ttc cgc tec acg cgt caa gtt age aac tac ccg gtg gtg 15163 
Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val 
410 415 420 

ggc gec gag etc ctg ccc gtc tac tec aag age ttc ttc aac gag cag 15211 
Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin 
425 430 435 

gee gtc tac teg cag cag ctg cgc gee ttc acc teg etc acg cac gtc 15259 
Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val 
440 445 450 

ttc aac cgc ttc ccc gag aac cag ate etc gtt cgc ccg ccc gcg ccc 15307 
Phe Asn Arg Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro 
455 460 465 ' 470 
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acc att acc acc gtc agt gaa aac gtt cct get etc aca gat cac ggg 15355 

Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly 

475 480 485 

acc ctg ccg ctg cgc age agt ate egg gga gtc cag cgc gtg acc gtc 15403 

Thr Leu Pro Leu Arg Ser Ser lie Arg Gly Val Gin Arg Val Thr Val 

490 495 500 

act gac gee aga cgc cgc acc tgc ccc tac gtc tac aag gee ctg ggc 15451 

Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly 

505 510 515 



gta gtc gcg ccg cgc gtc etc teg age cgc acc ttc taa aaaatgteea 15500 
Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe 



520 




525 




530 






ttctcatctc 


geccagtaat 


aacaceggtt 


ggggcctgcg 


cgcgcccagc 


aagatgtacg 


15560 


gaggegcteg 


ccaacgctcc 


acgcaacacc 


ccgtgcgcgt 


gcgcgggcac 


ttccgcgctc 


15620 


cctggggcgc 


cctcaagggc 


cgcgtgcgct 


cgcgcaccac 


cgtcgacgac 


gtgatcgacc 


15680 


aggtggtggc 


cgacgcgcgc 


aactacacgc 


ccgccgccgc 


gcccgtctcc 


accgtggacg 


15740 


ccgtcatcga 


cagcgtggtg 


gccgacgcgc 


gccggtacgc 


ccgcgccaag 


agccggcggc 


15800 


ggcgcatcgc 


ccggcggcac 


cggagcaccc 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


15860 


geagggecag 


gcgcacggga 


cgcagggcca 


tgetcaggge 


ggccagacgc 


gcggcctccg 


15920 


gcagcagcag 


cgccggcagg 


acccgcagac 


gcgcggccac 


ggcggcggcg 


gcggccatcg 


15980 


ccagcatgtc 


ccgcccgcgg 


cgcggcaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


16040 


tgcgcgtgcc 


cgtgcgcacc 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 


16100 


atgtgtccca 


gcggcgagga 


ggatgtccaa 


gegcaaatte 


aaggaagaga 


tgctccaggt 


loloO 


catcgcgcct' 


gagatctacg 


gcccggcggc 


ggtgaaggag 


gaaagaaagc 


cccgcaaact 


16220 


gaagcgggtc 


aaaaaggaca 


aaaaggagga 


ggaagatgtg 


gaeggactgg 


tggagtttgt 


16280 


gcgcgagttc 


gccccccggc 


ggcgcgtgca 


gtggcgcggg 


cggaaagtga 


aaceggtget 


16340 


gcgacccggc 


accacggtgg 


tcttcacgcc 


eggegagegt 


tccggctccg 


cctccaagcg 


16400 


ctcctacgac 


gaggtgtacg 


gggacgagga 


catcctcgag 


caggcggccg 


aacgtctggg 


16460 


egagtttget 


taeggcaage 


gcagccgccc 


cgcgcccttg 


aaagaggagg 


cggtgtccat 


16520 


cccgctggac 


cacggcaacc 


ccacgccgag 


cctgaagccg 


gtgaccctgc 


ageaggtget 


16580 


gcctggtgcg 


gcgccgcgcc 


ggggcttcaa 


gegegaggge 


ggegaggate 


tgtacccgac 


16640 


catgeagctg 


atggtgccca 


agegecagaa 


gctggaggac 


gtgctggagc 


acatgaaggt 


16700 


ggaccccgag 


gtgcagcccg 


aggtcaaggt 


gcggcccatc 


aagcaggtgg 


ccccgggcct 


16760 
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gggcgtgcag 


accgtggaca 


tcaagatccc 


cacggagccc atggaaacgc 


agaccgagcc 


16820 


cgtgaagccc 


agcaccagca 


ccatggaggt 


gcagacggat ccctggatgc 


cggcaccggc 


16880 


ttccaccacc 


cgccgaagac 


gcaagtacgg 


cgcggccagc ctgctgatgc 


ccaactacgc 


16940 


gctgcatcct 


tccatcatcc 


ccacgccggg 


ctaccgcggc acgcgcttct 


accgcggcta 


17000 


caccagcagc 


cgccgccgca 


agaccaccac 


ccgccgccgc cgtcgtcgca 


cccgccgcag 


17060 


cagcaccgcg 


acttccgccg 


ccgccctggt 


gcggagagtg taccgcagcg 


ggcgcgagcc 


17120 


tctgaccctg 


ccgcgcgcgc 


gctaccaccc 


gagcatcgcc atttaactac 


cgcctcctac 


17180 


ttgcagatat 


ggccctcaca 


tgccgcctcc 


gcgtccccat tacgggctac 


cgaggaagaa 


17240 


agccgcgccg 


tagaaggctg 


acggggaacg 


ggctgcgtcg ccatcaccac 


cggcggcggc 


17300 


gcgccatcag 


caagcggttg 


gggggaggct 


tcctgcccgc gctgatgccc 


atcatcgccg 


17360 


cggcgatcgg 


ggcgatcccc 


ggcatagctt 


ccgtggcggt gcaggcctct 


cagcgccact 


17420 


gagacacagc 


ttggaaaatt 


tgtaataaaa 


aatggactga cgctcctggt 


cctgtgatgt 


17480 


gtgtttttag 


atggaagaca 


tcaatttttc 


gtccctggca ccgcgacacg 


gcacgcggcc 


17540 


gtttatgggc 


acctggagcg 


acatcggcaa 


cagccaactg aacgggggcg 


ccttcaattg 


17600 


gagcagtctc 


tggagcgggc 


ttaagaattt 


cgggtccacg ctcaaaacct 


atggcaacaa 


17660 


ggcgtggaac 


agcagcacag 


ggcaggcgct 


gagggaaaag ctgaaagagc 


agaacttcca 


17720 


gcagaaggtg 


gtcgatggcc 


tggcctcggg 


catcaacggg gtggtggacc 


tggccaacca 


17780 


ggccgtgcag 


aaacagatca 


acagccgcct 


ggacgcggtc ccgcccgcgg 


ggtccgtgga 


17840 


gatgccccag 


gtggaggagg 


agctgcctcc 


cctggacaag cgcggcgaca 


agcgaccgcg 


17900 


tcccgacgcg 


gaggagacgc 


tgctgacgca 


cacggacgag ccgcccccgt 


acgaggaggc 


17960 


ggtgaaactg 


ggtctgccca 


ccacgcggcc 


cgtggcgcct ctggccaccg 


gggtgctgaa 


18020 


acccagcagc 


agcagcagcc 


agcccgcgac 


cctggacttg cctccgcctg 


cttcccgccc 


18080 


ctccacagtg 


gctaagcccc 


tgccgccggt 


ggccgtcgcg tcgcgcgccc 


cccgaggccg 


18140 


cccccaggcg 


aactggcaga 


gcactctgaa 


cagcatcgtg ggtctgggag 


tgcagagtgt 


18200 


gaagcgccgc 


cgctgctatt 


aaaagacact 


gtagcgctta acttgcttgt 


ctgtgtgtat 


18260 


atgtatgtcc 


gccgaccaga 


aggaggagga 


agaggcgcgt cgccgagttg 


caag atg 


18317 



Met 
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gcc acc cca teg atg ctg ccc cag tgg gcg tac atg cac ate gec gga 18365 
Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala Gly 
535 540 545 

cag gac get teg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc 18413 
Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg 
550 555 560 

gcc aca gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg 18461 
Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr 
565 570 575 

gtg gcg ccc acg cac gat gtg acc acc gac cgc age cag egg ctg acg 18509 
Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr 
580 585 590 595 

ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac teg tac aaa 18557 
Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys 
600 605 610 

gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc 18605 
Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala 
615 620 625 

age acc tac ttt gac ate cgc ggc gtg ctg gat egg ggc cct age ttc 18653 
Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser Phe 
630 635 640 

aaa ccc tac tec ggc acc get tac aac age ctg get ccc aag gga gcg 18701 
Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala 
645 650 655 

ccc aac act tgc cag tgg aca tat aaa get gat ggt gat act ggt aca 18749 
Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly Thr 
660 665 670 675 

gaa aaa acc tat aca tat gga aat gcg cct gtg caa ggc att agt att 18797 
Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Ser lie 
680 685 690 

aca aaa gat ggt att caa ctt gga act gac act gat gat cag ccc att 18845 
Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro lie 
695 700 * 705 

tat gca gat aaa act tat caa cca gag cct caa gtg ggt gat get gaa 18893 
Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 
710 715 720 

tgg cat gac ate act ggt act gat gaa aaa tat gga ggc aga get etc 18941 
Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 
725 730 735 

aag cct gac acc aaa atg aag ccc tgc tat ggt tct ttt gcc aag cct 18989 
Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 
740 745 750 755 
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acc aat aaa gaa gga ggt cag gca aat gtg aaa acc gaa aca ggc ggt 19037 
Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 
760 765 770 

acc aaa gaa tat gac att gac atg gca ttc ttc gat aat cga agt gca 19085 
Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 
775 780 * 785 

get gcg get ggc ctg gec cca gaa att gtt ttg tat act gag aat gtg 19133 
Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn Val 
790 795 800 

gat ctg gaa act cca gat act cat att gta tac aag gcg ggc aca gat 19181 
Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr Asp 
805 810 815 

gac age age tct tct ate aat ttg ggt cag cag tec atg ccc aac aga 19229 
Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
820 825 830 835 

ccc aac tac att ggc ttt aga gac aac ttt ate ggg etc atg tac tac 19277 
Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr 
840 845 850 

aac age act ggc aac atg ggc gtg ctg get ggt cag gee tec cag ctg 19325 
Asn Ser Thr Gly Asn' Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
855 860 865 

aat get gtg gtg gac ttg cag gac aga aac act gaa ctg tec tac cag 19373 
Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
870 875 880 

etc ttg ctt gac tct ctg ggc gac aga acc agg tat ttc agt atg tgg 19421 
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
885 890 895 



aat cag gcg gtg gac age tat gac ccc gat gtg cgc att att gaa aat 
Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
900 905 910 " 915 



19469 



cac ggt gtg gag gat gaa etc cct aac tat tgc ttc ccc ctg gat get 19517 
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
920 925 930 

gtg ggt aga act gat act tac cag gga att aag gee aat ggt get gat 19565 
Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala Asp 
935 940 945 

caa acc acc tgg acc aaa gat gat act gtt aat gat get aat gaa ttg 19613 
Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
950 955 960 

ggc aag ggc aat cct ttc gee atg gag ate aac ate cag gee aac ctg 19661 
Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn lie Gin Ala Asn Leu 
965 970 975 
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tgg egg aac ttc etc tac gcg aac gtg gcg ctg tac ctg ccc gac tec 19709 
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 
980 985 990 995 

tac aag tac acg ccg gee aac ate acg ctg ccg ace aac ace aac 19754 
Tyr Lys Tyr Thr Pro Ala Asn lie Thr Leu Pro Thr Asn Thr Asn 
1000 1005 1010 

acc tac gat tac atg aac ggc cgc gtg gtg gcg ccc teg ctg gtg 19799 
Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val 
1015 " 1020 1025 

gac gee tac ate aac ate ggg gcg cgc tgg teg ctg gac ccc atg 19844 
Asp Ala Tyr lie Asn lie Gly Ala Arg Trp Ser Leu Asp Pro Met 
1030 " " 1035 1040 

gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg cgc 19889 
Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg 
1045 1050 1055 

tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc cac 19934 
Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 
1060 1065 1070 

ate cag gtg ccc caa aag ttc ttc gee ate aag age etc ctg etc 19979 
lie Gin Val Pro Gin Lys Phe Phe Ala lie Lys Ser Leu Leu Leu 
1075 " 1080 ~ 1085 

ctg ccc ggg tec tac acc tac gag tgg aac ttc cgc aag gac gtc 20024 
Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val 
1090 1095 1100 

aac atg ate ctg cag age tec etc ggc aac gac ctg cgc acg gac 20069 
Asn Met lie Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp 
1105 1110 1115 

999 9 CC tec ate gee ttc acc age ate aac etc tac gee acc ttc 20114 
Gly Ala Ser lie Ala Phe Thr Ser lie Asn Leu Tyr Ala Thr Phe 
1120 1125 1130 

ttc ccc atg gcg cac aac acc gee tec acg etc gag gee atg ctg 20159 
Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 
1135 1140 1145 

cgc aac gac acc aac gac cag tec ttc aac gac tac etc teg gcg 20204 
Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala 
1150 * 1155 1160 

gee aac atg etc tac ccc ate ccg gee aac gee acc aac gtg ccc 20249 
Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala Thr Asn Val Pro 
1165 1170 1175 

ate tec ate ccc teg cgc aac tgg gee gee ttc cgc gga tgg tec 20294 
lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser 
1180 1185 1190 
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ttc acg cgc etc aag 
Phe Thr Arg Leu Lys 
1195 



acc cgc gag acg ccc teg etc ggc tec ggg 
Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly 
1200 1205 



20339 



ttc gac ccc tac ttc gtc tac teg ggc tec ate ccc tac etc gac 
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu Asp 
1210 1215 1220 



20384 



ggc acc ttc tac etc aac cac acc ttc aag aag gtc tec ate acc 
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He Thr 
1225 1230 1235 



20429 



ttc gac tec tec gtc age tgg ccc ggc aac gac cgc etc ctg acg 
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
1240 1245 1250 



20474 



ccc aac gag ttc gaa 
Pro Asn Glu Phe Glu 
1255 



ate aag cgc acc gtc gac gga gag ggg tac 
He Lys Arg Thr Val Asp Gly Glu Gly Tyr 
1260 " 1265 



20519 



aac gtg gee cag tgc aac atg acc aag gac tgg ttc ctg gtc cag 
Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin 
1270 1275 1280 



20564 



atg ctg gee cac tac aac ate ggc tac cag ggc ttc tac gtg ccc 
Met Leu Ala His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro 
1285 1290 ~ ^ 1295 



20609 



gag ggc tac aag gac cgc atg tac tec ttc ttc cgc aac ttc cag 
Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 
1300 1305 1310 



20654 



ccc atg age cgc cag 
Pro Met Ser Arg Gin 
1315 



gtc gtg gac gag gtc aac tac aag gac tac 
Val Val Asp Glu Val Asn Tyr Lys Asp Tyr 
1320 1325 



20699 



cag gee gtc acc ctg gee tac cag cac aac aac teg ggc ttc gtc 
Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe Val 
1330 1335 ' 1340 



20744 



ggc tac etc gcg ccc 
Gly Tyr Leu Ala Pro 
1345 



acc atg cgc cag gga cag ccc tac ccc gee 
Thr Met Arg Gin Gly Gin Pro Tyr Pro Ala 
1350 1355 



20789 



aac tac ccc tac ccg etc ate ggc aag age gee gtc gee age gtc 
Asn Tyr Pro Tyr Pro Leu He Gly Lys Ser Ala Val Ala Ser Val 
1360 1365 1370 



20834 



acc cag aaa aag ttc 
Thr Gin Lys Lys Phe 
1375 



etc tgc gac egg gtc atg tgg cgc ate ccc 
Leu Cys Asp Arg Val Met Trp Arg He Pro 
1380 1385 



20879 



ttc tec age aac 
Phe Ser Ser Asn 



ttc atg tec atg ggc gcg etc acc gac etc ggc 
Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly 
1390 ~ 1395 4 1400 



20924 
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cag aac atg etc tac gcc aac tec gcc cac gcg eta gac atg aat 20969 
Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn 
1405 1410 ~ 1415 

ttc gaa gtc gac ccc atg gat gag tec ace ctt etc tat gtt gtc 21014 
Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val 
1420 1425 ~* 1430 

ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc 21059 
Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg Gly 
1435 1440 1445 

gtc ate gag gcc gtc tac ctg cgc acg ccc ttc teg gcc ggc aac 21104 
Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 
1450 1455 1460 

gcc ace acc taa gccccgctct tgcttcttgc aagatgaegg cctgtgcggg 21156 
Ala Thr Thr 



ctccggcgag caggagctca gggccatcct ccgcgacctg ggctgeggge cctgcttcct 21216 

gggcaccttc gaeaageget tecegggatt catggccccg cacaagctgg cctgcgccat 21276 

cgtcaacacg gccggccgcg agaceggggg cgagcactgg ctggccttcg cctggaaccc 21336 

gcgctcccac acctgctacc tcttcgaccc ettegggtte teggacgage gcctcaagca 21396 

gatctaccag ttcgagtacg agggectget gcgccgcagc gccctggcca ccgaggaccg 21456 

ctgcgtcacc ctggaaaagt ccacccagac cgtgcagggt ccgcgctcgg ccgcctgcgg 21516 

gctcttctgc tgcatgttcc tgcacgcctt cgtgcactgg cccgaccgcc ccatggacaa 21576 

gaaccccacc atgaacttgc tgacgggggt gcccaacggc atgctccagt cgccccaggt 21636 

ggaacccacc ctgcgccgca accaggaggc gctctaccgc ttcctcaacg cccactccgc 21696 

ctactttcgc tcccaccgcg cgcgcatcga gaaggccacc gccttcgacc gcatgaatca 21756 

agacatgtaa accgtgtgtg tatgtgaatg ctttattcat aataaacagc acatgtttat 21816 

gccacctttt ctgaggctct gactttattt agaaatcgaa ggggttctgc cggctctcgg 21876 

cgtgccccgc gggcagggat acgttgcgga actggtactt gggcagccac ttgaactegg 21936 

ggatcagcag cttcggcacg gggaggtegg ggaacgagtc gctccacagc ttgcgcgtga 21996 

gttgeaggge gcccagcagg tegggegegg agatcttgaa atcgcagttg ggacccgcgt 22056 

tctgcgcgcg ggagttgcgg tacaeggggt tgcagcactg gaacaccatc agggcegggt 22116 

gcttcacgct cgccagcacc gtcgegtegg tgatgccctc cacgtccaga tcctcggcgt 22176 

tggccatccc gaagggggtc atettgeagg tctgccgccc catgetggge acgcagccgg 22236 

gcttgtggtt geaategcag tgcaggggga tcagcatcat ctgggcctgc teggagctea 22296 
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tgcccgggta catggccttc 


atgaaagcct 


ccagctggcg 


gaaggcctgc 


tgcgccttgc 


22356 


cgccctcggt gaagaagacc 


ccgcaggact 


tgctagagaa 


ctggttggtg 


gcgcagccgg 


22416 


cgtcgtgcac gcagcagcgc 


gcgtcgttgt 


tggccagctg 


caccacgctg 


cgcccccagc 


22476 


ggttctgggt gatcttggcc 


cggtcggggt 


tctccttcag 


cgcgcgctgc 


ccgttctcgc 


22536 


tcgccacatc catctcgatc 


gtgtgctcct 


tctggatcat 


cacggtcccg 


tgcaggcatc 


22596 


gcagcttgcc ctcggcctcg 


gtgcacccgt 


gcagccacag 


cgcgcagccg 


gtgcactccc 


22656 


agttcttgtg ggcgatctgg 


gagtgcgagt 


gcacgaagcc 


ctgcaggaag 


cggcccatca 


22716 


tcgtggtcag ggtcttgttg 


ctggtgaagg 


tcagcgggat 


gccgcggtgc 


tcctcgttca 


22776 


catacaggtg gcagatgcgg 


cggtacacct 


cgccctgctc 


gggcatcagc 


tggaaggcgg 


22836 


acttcaggtc gctctccacg 


cggtaccggt 


ccatcagcag 


cgtcatgact 


tccatgccct 


22896 


tctcccaggc cgagacgatc 


ggcaggctca 


gggggttctt 


caccgccgtt 


gtcatcttag 


22956 


tcgccgccgc tgaggtcagg 


gggtcgttct 


cgtccagggt 


ctcaaacact 


cgcttgccgt 


23016 


ccttctcggt gatgcgcacg 


gggggaaagc 


tgaagcccac 


ggccgccagc 


tcctcctcgg 


23076 


cctgcctttc gtcctcgctg 


tcctggctga 


tgtcttgcaa 


aggcacatgc 


ttggtcttgc 


23136 


ggggtttctt tttgggcggc 


agaggcggcg 


gcggagacgt 


gctgggcgag 


cgcgagttct 


23196 


cgctcaccac gactatttct 


tcttcttggc 


cgtcgtccga 


gaccacgcgg 


cggtaggcat 


23256 


gcctcttctg gggcagaggc 


ggaggcgacg 


ggctctcgcg 


gttcggcggg 


cggctggcag 


23316 


agccccttcc gcgttcgggg 


gtgcgctcct 


ggcggcgctg 


ctctgactga 


cttcctccgc 


23376 


ggccggccat tgtgttctcc 


tagggagcaa 


caagcatgga 


gactcagcca 


tcgtcgccaa 


23436 


catcgccatc tgcccccgcc 


gccgccgacg 


agaaccagca 


gcagaatgaa 


agcttaaccg 


23496 


ccccgccgcc cagccccacc 


tccgacgccg 


ccgcggcccc 


agacatgcaa 


gagatggagg 


23556 


aatccatcga gattgacctg 


ggctacgtga 


cgcccgcgga 


gcacgaggag 


gagctggcag 


23616 


cgcgcttttc agccccggaa 


gagaaccacc 


aagagcagcc 


agagcaggaa 


gcagagagcg 


23676 


agcagcagca ggctgggctc 


gagcatggcg 


actacctgag 


cggggcagag 


gacgtgctca 


23736 


tcaagcatct ggcccgccaa 


tgcatcatcg 


tcaaggacgc 


gctgctcgac 


cgcgccgagg 


23796 


tgcccctcag cgtggcggag 


ctcagccgcg 


cctacgagcg 


caacctcttc 


tcgccgcgcg 


23856 


tgccccccaa gcgccagccc 


aacggcacct 


gcgagcccaa 


cccgcgcctc 


aacttctacc 


23916 


cggtcttcgc ggtgcccgag 


gccctggcca 


cctaccacct 


ctttttcaag 


aaccaaagga 


23976 
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tccccgtctc ctgccgcgcc aaccgcaccc gcgccgacgc cctgctcaac ctgggtcccg 24036 

gcgcccgcct acctgatatc gcctccttgg aagaggttcc caagatcttc gagggtctgg 24096 

gcagcgacga gactcgggcc gcgaacgctc tgcaaggaag cggagaggag catgagcacc 24156 

acagcgccct ggtggagttg gaaggcgaca acgcgcgcct ggcggtgctc aagcgcacgg 24216 

tcgagctgac ccacttcgcc tacccggcgc tcaacctgcc ccccaaggtc atgagcgccg 24276 

tcatggacca ggtgctcatc aagcgcgcct cgcccctctc ggatgaggac atgcaggacc 24336 

ccgagagctc ggacgagggc aagcccgtgg tcagcgacga gcagctggcg cgctggctgg 24396 

gagcgagtag caccccccag agcttggaag agcggcgcaa gctcatgatg gccgtggtcc 24456 

tggtgaccgt ggagctggag tgtctgcgcc gcttcttcgc cgacgcagag accctgcgca 24516 

aggtcgagga gaacctgcac tacctcttca ggcacgggtt tgtgcgccag gcctgcaaga 24576 

tctccaacgt ggagctgacc aacctggtct cctacatggg catcctgcac gagaaccgcc 24636 

tggggcagaa cgtgctgcac accaccctgc gcggggaggc ccgccgcgac tacatccgcg 24696 

actgcgtcta cctgtacctc tgccacacct ggcagacggg catgggcgtg tggcagcagt 24756 

gcctggagga gcagaacctg aaagagctct gcaagctcct gcagaagaac ctgaaggccc 24816 

tgtggaccgg gttcgacgag cgcaccaccg cctcggacct ggccgacctc atcttccccg 24876 

agcgcctgcg gctgacgctg cgcaacggac tgcccgactt tatgagtcaa agcatgttgc 24936 

aaaactttcg ctctttcatc ctcgaacgct ccgggatcct gcccgccacc tgctccgcgc 24996 

tgccctcgga cttcgtgccg ctgaccttcc gcgagtgccc cccgccgctc tggagccact 25056 

gctacctgct gcgcctggcc aactacctgg cctaccactc ggacgtgatc gaggacgtca 25116 

gcggcgaggg tctgctcgag tgccactgcc gctgcaacct ctgcacgccg caccgctccc 25176 

tggcctgcaa cccccagctg ctgagcgaga cccagatcat cggcaccttc gagttgcaag 25236 

gccccggcga gggcaagggg ggtctgaaac tcaccccggg gctgtggacc tcggcctact 25296 

tgcgcaagtt cgtgcccgag gactaccatc ccttcgagat caggttctac gaggaccaat 25356 

cccagccgcc caaggccgaa ctgtcggcct gcgtcatcac ccagggggcc atcctggccc 25416 

aattgcaagc catccagaaa tcccgccaag aatttctgct gaaaaagggc cacggggtct 25476 

acctggaccc ccagaccgga gaggagctca accccagctt cccccaggat gccccgagga 25536 

agcagcaaga agctgaaagt ggagctgccg ccgccggagg atttggagga agactgggag 25596 

agcagtcagg cagaggagga ggagatggaa gactgggaca gcactcaggc agaggaggac 25656 

agcctgcaag acagtctgga agacgaggtg gaggaggagg cagaggaaga agcagccgcc 25716 
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gccagaccgt 


cgtcctcggc 


ggagaaagca 


agcagcacgg 


ataccatctc 


cgctccgggt 


25776 


cggggtcgcg 


gcgaccgggc 


ccacagtagg 


tgggacgaga 


ccgggcgctt 


cccgaacccc 


25836 


accacccaga 


ccggtaagaa 


ggagcggcag 


ggatacaagt 


cctggcgggg 


gcacaaaaac 


25896 


gccatcgtct 


cctgcttgca 


agcctgcggg 


ggcaacatct 


ccttcacccg 


ccgctacctg 


25956 


ctcttccacc 


gcggggtgaa 


cttcccccgc 


aacatcttgc 


attactaccg 


tcacctccac 


26016 


agcccctact 


actgtttcca 


agaagaggca 


gaaacccagc 


agcagcagaa 


aaccagcggc 


26076 


agcagcagct 


agaaaatcca 


cagcggcggc 


aggtggactg 


aggatcgcag 


cgaacgagcc 


26136 


ggcgcagacc 


cgggagctga 


ggaaccggat 


ctttcccacc 


ctctatgcca 


tcttccagca 


26196 


gagtcggggg 


caggagcagg 


aactgaaagt 


caagaaccgt 


tctctgcgct 


cgctcacccg 


26256 


cagttgtctg 


tatcacaaga 


gcgaagacca 


acttcagcgc 


actctcgagg 


acgccgaggc 


26316 


tctcttcaac 


aagtactgcg 


cgctcactct 


taaagagtag 


cccgcgcccg 


cccacacacg 


26376 


gaaaaaggcg 


ggaattacgt 


caccacctgc 


gcccttcgcc 


cgaccatcat 


catgagcaaa 


26436 


gagattccca 


cgccttacat 


gtggagctac 


cagccccaga 


tgggcctggc 


cgccggcgcc 


26496 


gcccaggact 


actccacccg 


catgaactgg 


ctcagcgccg 


ggcccgcgat 


gatctcacgg 


26556 


gtgaatgaca 


tccgcgcccg 


ccgaaaccag 


atactcctag 


aacagtcagc 


gatcaccgcc 


26616 


acgccccgcc 


atcaccttaa 


tccgcgtaat 


tggcccgccg 


ccctggtgta 


ccaggaaatt 


26676 


ccccagccca 


cgaccgtact 


acttccgcga 


gacgcccagg 


ccgaagtcca 


gctgactaac 


26736 


tcaggtgtcc 


agctggccgg 


cggcgccgcc 


ctgtgtcgtc 


accgccccgc 


tcagggtata 


26796 


aagcggctgg 


tgatccgagg 


cagaggcaca 


cagctcaacg 


acgaggtggt 


gagctcttcg 


26856 


ctgggtctgc 


gacctgacgg 


agtcttccaa 


ctcgccggat 


cggggagatc 


ttccttcacg 


26916 


cctcgtcagg 


ccgtcctgac 


tttggagagt 


tcgtcctcgc 


agccccgctc 


gggtggcatc 


26976 


ggcactctcc 


agttcgtgga 


ggagttcact 


ccctcggtct 


acttcaaccc 


cttctccggc 


27036 


tcccccggcc 


actacccgga 


cgagttcatc 


ccgaacttcg 


acgccatcag 


cgagtcggtg 


27096 


gacggctacg 


attgaatgtc 


ccatggtggc 


gcagctgacc 


tagctcggct 


tcgacacctg 


27156 


gaccactgcc 


gccgcttccg 


ctgcttcgct 


cgggatctcg 


ccgagtttgc 


ctactttgag 


27216 


ctgcccgagg 


agcaccctca 


gggcccggcc 


cacggagtgc 


ggatcatcgt 


cgaagggggc 


27276 


ctcgactccc 


acctgcttcg 


gatcttcagc 


cagcgaccga 


tcctggtcga 


gcgcgagcaa 


27336 


ggacagaccc 


ttctgaccct 


gtactgcatc 


tgcaaccacc 


ccggcctgca 


tgaaagtctt 


27396 
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tgttgtctgc tgtgtactga gtataataaa 


agctgagatc 


agcgactact 


ccggactcga 


27456 


ttgtggtgtt cctgctatca accggtccct 


gttcttcacc 


gggaacgaga 


ccgagctcca 


27516 


gcttcagtgt aagccccaca agaagtacct 


cacctggctg 


ttccagggct 


ccccgatcgc 


27576 


cgttgtcaac cactgcgaca acgacggagt 


cctgctgagc 


ggccccgcca 


accttacttt 


27636 


ttccacccgc agaagcaagc tccagctctt 


ccaacccttc 


ctccccggga 


cctatcagtg 


27696 


cgtctcggga ccctgccatc acaccttcca 


cctgatcccg 


aataccacag 


cgccgctccc 


27756 


cgctactaac aaccaaacta cccaccatcg 


ccaccgtcgc 


gacctttctg 


aatctaacac 


27816 


taccacccac accggaggtg agctccgagg 


tcgaccaacc 


tctgggattt 


actacggccc 


27876 


ctgggaggtg gtggggttaa tagcgctagg 


cctagttgtg 


ggtgggcttt 


tggctctctg 


27936 


ctacctatac ctcccttgct gttcgtactt 


agtggtgctg 


tgttgctggt 


ttaagaaatg 


27996 


gggaagatca ccctagtgag ctgcggtgcg 


ctggtggcgg 


tggtggtgtt 


ttcgattgtg 


28056 


ggactgggcg gcgcggctgt agtgaaggag 


aaggccgatc 


cctgcttgca 


tttcaatccc 


28116 


gacaattgcc agctgagttt tcagcccgat 


ggcaatcggt 


gcgcggtgct 


gatcaagtgc 


28176 


ggatgggaat gcgagaacgt gagaatcgag 


tacaataaca 


agactcggaa 


caatactctc 


28236 


gcgtccgtgt ggcagcccgg ggaccccgag 


tggtacaccg 


tctctgtccc 


cggtgctgac 


28296 


ggctccccgc gcaccgtgaa caatactttc 


atttttgcgc 


acatgtgcga 


cacggtcatg 


28356 


tggatgagca agcagtacga tatgtggccc 


cccacgaagg 


agaacatcgt 


ggtcttctcc 


28416 


atcgcttaca gcgcgtgcac ggcgctaatc 


accgctatcg 


tgtgcctgag 


cattcacatg 


28476 


ctcatcgcta ttcgccccag aaataatgcc 


gaaaaagaga 


aacagccata 


acacgttttt 


28536 


tcacacacct ttttcagacc atggcctctg 


ttaaattttt 


gcttttattt 


gccagtctca 


28596 


ttactgttat aagtaatgag aaactcacta 


tttacattgg 


cactaaccac 


actttagacg 


28656 


gaattccaaa atcctcatgg tattgctatt 


ttgatcaaga 


tccagactta 


actatagaac 


28716 


tgtgtggtaa caagggaaaa aatacaagca 


ttcatttaat 


taactttaat 


tgcggagaca 


28776 


atttgaaatt aattaatatc actaaagagt 


atggaggtat 


gtattactat 


gttgcagaaa 


28836 


ataacaacat gcagttttat gaagttactg 


taactaatcc 


caccacacct 


agaacaacaa 


28896 


caaccaccac cacaaaaact acacctgtta 


ccactatgca 


gctcactacc 


aataacattt 


28956 


ttgccatgcg tcaaatggtc aacaatagca 


ctcaacccac 


cccacccagt 


gaggaaattc 


29016 


ccaaatccat gattggcatt attgttgctg 


tagtggtgtg 


catgttgatc 


atcgccttgt 


29076 


gcatggtgta ctatgccttc tgctacagaa 


agcacagact 


gaacgacaag 


ctggaacact 


29136 
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tactaagtgt tgaattttaa ttttttagaa ccatgaagat cctaggcctt ttaatttttt 29196 

ctatcattac ctctgctcta tgcaattctg acaatgagga cgttactgtc gttgtcggaa 29256 

ccaattatac actgaaaggt ccagcgaagg gtatgctttc gtggtattgc tggtttggaa 29316 

ctgacgagca acagacagag ctctgcaatg ctcaaaaagg caaaacctca aattctaaaa 29376 

tctctaatta tcaatgcaat ggcactgact tagtactgct caatgtcacg aaagcatatg 29436 

ctggcagcta cacctgccct ggagatgata ctgagaacat gattttttac aaagtggaag 29496 

tggttgatcc cactactcca cctccaccca ccacaactac tcacaccaca cacacagaac 29556 

aaaccacagc agaggaggca gcaaagttag ccttgcaggt ccaagacagt tcatttgttg 29616 

gcattacccc tacacctgat cagcggtgtc cggggctgct cgtcagcggc attgtcggtg 29676 

tgctttcggg attagcagtc ataatcatct gcatgttcat ttttgcttgc tgctatagaa 29736 

ggctttaccg acaaaaatca gacccactgc tgaacctcta tgtttaattt tttccagagc 29796 

catgaaggca gttagcactc tagttttttg ttctttgatt ggcactgttt ttagtgttag 29856 

ctttttgaaa caaatcaatg ttactgaggg ggaaaatgtg acactggtag gcgtagaggg 29916 

tgctcaaaat accacctgga caaaattcca tctagatggg tggaaagaaa tttgcacctg 29976 

gaatgtcagt acttatacat gtgaaggagt taatcttacc attgtcaatg tcagccaaat 30036 

tcaaaagggt tggattaaag ggcaatctgt tagtgttagc aatagtgggt actataccca 30096 

gcatactctt atctatgaca ttatagttat accactgcct acacctagcc cacctagcac 30156 

taccacacag acaacccaca ctacacaaac aaccacatac agtacatcaa atcagcctac 30216 

caccactaca acagcagagg ttgccagctc gtctggggtc cgagtggcat ttttgatgtt 30276 

ggccccatct agcagtccca ctgctagtac caatgagcag actactgaat ttttgtccac 30336 

tgtcgagagc cacaccacag ctacctcgag tgccttctct agcaccgcca atctatcctc 30396 

gctttcctct acaccaatca gtcccgctac tactcctacc cccgctattc tccccactcc 30456 

cctgaagcaa acagacggcg acatgcaatg gcagatcacc ctgctcattg tgatcgggtt 30516 

ggtcatcctg gccgtgttgc tctactacat cttctgccgc cgcattccca acgcgcaccg 30576 

caagccggcc tacaagccca tcgttgtcgg gcagccggag ccgcttcagg tggaaggggg 30636 

tctaaggaat cttctcttct cttttacagt atggtgattg aattatgatt cctagacaaa 30696 

tcttgatcac tattcttatc tgcctcctcc aagtctgtgc caccctcgct ctggtggcca 30756 

acgccagtcc agactgtatt gggcccttcg cctcctacgt gctctttgcc ttcatcacct 30816 
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gcatctgctg 


ctgtagcata 


gtctgcctgc ttatcacctt 


cttccagttc 


attgactgga 


30876 


tctttgtgcg 


catcgcctac 


ctgcgccacc 


acccccagta 


ccgcgaccag 


cgagtggcgc 


30936 


ggctgctcag 


gatcctctga 


taagcatgcg 


ggctctgeta 


cttctcgcgc 


ttctgctgtt 


30996 


agtgctcccc 


cgtcccgtcg 


acccccggac 


ccccacccag 


tcccccgagg 


aggtcegcaa 


31056 


atgcaaattc 


caagaaccct 


ggaaattcct 


caaatgetae 


cgccaaaaat 


cagacatgea 


31116 


tcccagctgg 


atcatgatca 


ttgggatcgt 


gaacattctg 


gcctgcaccc 


tcatctcctt 


31176 


tgtgatttac 


ccctgctttg 


actttggttg 


gaactcgcca 


gaggegctet 


atctcccgcc 


31236 


tgaacctgac 


acaccaccac 


agcaacctca 


ggcacacgca 


ctaccaccac 


caccacagcc 


31296 


taggccacaa 


tacatgccca 


tattagacta tgaggecgag 


ccacagcgac 


ccatgctccc 


31356 


cgctattagt 


tacttcaatc 


taaceggegg agatgactga 


cccactggcc 


aacaacaacg 


31416 


tcaacgacct 


tctcctggac 


atggacggcc 


gcgcctcgga 


gcagcgactc 


gcccaacttc 


31476 


gcattcgcca 


gcagcaggag 


agagcegtea 


aggagctgea 


ggaeggcata 


gccatccacc 


31536 


agtgcaagaa 


aggcatcttc 


tgcctggtga 


aacaggecaa 


gatctcctac 


gaggtcaccc 


31596 


agaccgacca 


tcgcctctcc 


tacgagctcc 


tgcagcagcg 


ccagaagttc 


acctgcctgg 


31656 


tcggagtcaa 


ccccatcgtc 


atcacccagc 


agtegggega 


taccaagggg 


tgcatccact 


31716 


gctcctgcga 


ctcccccgac 


tgcgtccaca 


ctctgatcaa 


gaccctctgc 


ggcctccgcg 


31776 


acctcctccc 


catgaactaa 


tcaccccctt 


atccagtgaa 


ataaagatca 


tattgatgat 


31836 


ttgagtttaa 


taaaaataaa 


gaatcactta 


cttgaaatct 


gataccaggt 


ctctgtccat 


31896 


gttttctgcc 


aacaccactt 


cactcccctc 


ttcccagctc 


tggtactgea 


ggccccggcg 


31956 


ggctgcaaac 


ttcctccaca 


ccctgaaggg 


gatgtcaaat 


tcctcctgtc 


cctcaatctt 


32016 


cattttatct 


tctatcag atg tec aaa 


aag cgc gtc 


: C( 3<3 <Jtg gat gat gac 


32067 



Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 
1465 1470 



ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32112 

Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 

1475 1480 1485 

ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 32157 

Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 

1490 1495 1500 

aag ccc ctg ggg gtg ctg tec ctg cgt ctg gee gat ccc gtc acc 32202 

Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 

1505 1510 1515 
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acc aag aac ggg gaa ate acc etc aag ctg gga gat ggg gtg gac 
Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Asp Gly Val Asp 
1520 1525 1530 



32247 



etc gac tec teg gga aaa etc ate tec aac acg gee acc aag gee 
Leu Asp Ser Ser Gly Lys Leu He Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 



32292 



gee gee cct etc agt ttt 
Ala Ala Pro Leu Ser Phe 
1550 1555 



tec aac aac acc att tec ctt aac atg 
Ser Asn Asn Thr He Ser Leu Asn Met 
1560 



32337 



gat acc cct ttt tac aac aac aat gga aag tta 
Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu 
1565 1570 1575 



ggc atg aaa gtc 
Gly Met Lys Val 



32382 



act get cca ctg aag ata eta gac aca gac ttg eta aaa aca ctt 
Thr Ala Pro Leu Lys He Leu Asp Thr Asp Leu Leu Lys Thr Leu 
1580 1585 1590 



32427 



gtt gta get tat gga caa ggt tta gga aca aac acc act ggt gee 
Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr Gly Ala 
1595 1600 1605 



32472 



ctt gtt gee caa eta gca tec cca ctt get ttt gat age aat age 
Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn Ser 
1610 1615 1620 



32517 



aaa att gec ctt aat tta ggc aat gga cca ttg 
Lys He Ala Leu Asn Leu Gly Asn Gly Pro Leu 
1625 1630 ~ ~ 1635 



aaa gtg gat gca 
Lys Val Asp Ala 



32562 



aat aga ctg aac ate aat tgc aat aga gga etc tat gtt act acc 
Asn Arg Leu Asn He Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr 
1640 1645 1650 



32607 



aca aaa gat gca ctg gaa gee aat ata agt tgg get aat get atg 
Thr Lys Asp Ala Leu Glu Ala Asn He Ser Trp Ala Asn Ala Met 
1655 1660 1665 



32652 



aca ttt ata gga aat gee atg ggt gtc aat att gat aca caa aaa 
Thr Phe He Gly Asn Ala Met Gly Val Asn lie Asp Thr Gin Lys 
1670 1675 1680 



32697 



ggc ttg caa ttt ggc acc act agt acc gtc gca gat gtt aaa aac 
Gly Leu Gin Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn 
1685 1690 1695 



32742 



get tac ccc ata caa ate aaa ctt gga get ggt etc aca ttt gac 
Ala Tyr Pro He Gin He Lys Leu Gly Ala Gly Leu Thr Phe Asp 
1700 1705 1710 



32787 



age aca ggt gca att gtt gca tgg aac aaa gat 
Ser Thr Gly Ala lie Val Ala Trp Asn Lys Asp 
1715 1720 1725 



gat gac aag ctt 
Asp Asp Lys Leu 



32832 
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aca eta tgg acc aca gee gac ccc tct cca aat tgt cac ata tat 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His lie Tyr 
1730 1735 1740 

tct gaa aag gat get aag ctt aca ctt tgc ttg aca aag tgt ggc 
Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly 
1745 1750 ^ 1755 

agt cag att ctg ggc act gtt tec etc ata get gtt gat act ggc 
Ser Gin lie Leu Gly Thr Val Ser Leu lie Ala Val Asp Thr Gly 

1770 



1760 



1765 



agt tta aat ccc ata aca gga aca gta acc act get ctt gtc tea 

Ser Leu Asn Pro He Thr Gly Thr Val Thr Thr Ala Leu Val Ser 

1775 1780 1785 

ctt aaa ttc gat gca aat gga gtt ttg caa age age tea aca eta 

Leu Lys Phe Asp Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu 

1790 1795 1800 

gac tea gac tat tgg aat ttc aga cag gga gat gtt aca cct get 

Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp Val Thr Pro Ala 

1805 1810 1815 

gaa gee tat act aat get ata ggt ttc atg ccc aat eta aaa gca 

Glu Ala Tyr Thr Asn Ala He Gly Phe Met Pro Asn Leu Lys Ala 

1820 1825 1830 

tac cct aaa aac aca agt gga get gca aaa agt cac att gtt ggg 

Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His He Val Gly 



1835 



1840 



1845 



aaa gtg tac eta cat ggg gat aca ggc aaa cca ctg gac etc att 

Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu He 

1850 1855 I860 

att act ttc aat gaa aca agt gat gaa tct tgc act tac tgt att 

He Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys He 

1865 1870 1875 

aac ttt caa tgg cag tgg ggg get gat caa tat aaa aat gaa aca 

Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr 

1880 1885 1890 

ctt gee gtc agt tea ttc acc ttt tec tat att get aaa gaa taa 

Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr He Ala Lys Glu 



1895 



1900 



1905 



accccactct gtaccccatc tctgtctatg gaaaaaactc tgaaacacaa aataaaataa 
agttcaagtg ttttattgat tcaacagttt tacaggattc gagcagttat ttttcctcca 
ccctcccagg acatggaata caccaccctc tccccccgca cagecttgaa catctgaatg 
ccattggtga tggacatget tttggtctcc acgttccaca cagtttcaga gcgagccagt 
ctegggtegg tcagggagat gaaaccctcc gggcactccc gcatctgcac ctcacagctc 



32877 

32922 

32967 

33012 

33057 

33102 

33147 

33192 

33237 

33282 

33327 

33372 

33432 
33492 
33552 
33612 
33672 
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aacagctgag gattgtcctc ggtggtcggg atcacggtta tctggaagaa gcagaagagc 33732 
ggcggtggga atcatagtcc gcgaacggga tcggccggtg gtgtcgcatc aggccccgca 33792 
gcagtcgctg tcgccgccgc tccgtcaagc tgctgctcag ggggtccggg tccagggact 33852 
ccctcagcat gatgcccacg gccctcagca tcagtcgtct ggtgcggcgg gcgcagcagc 33912 
gcatgcggat ctcgctcagg tcgctgcagt acgtgcaaca caggaccacc aggttgttca 33972 
acagtccata gttcaacacg ctccagccga aactcatcgc gggaaggatg ctacccacgt 34032 
ggccgtcgta ccagatcctc aggtaaatca agtggcgccc cctccagaac acgctgccca 34092 
tgtacatgat ctccttgggc atgtggcggt tcaccacctc ccggtaccac atcaccctct 34152 
ggttgaacat gcagccccgg atgatcctgc ggaaccacag ggccagcacc gccccgcccg 34212 
ccatgcagcg aagagacccc gggtcccgac aatggcaatg gaggacccac cgctcgtacc 34272 
cgtggatcat ctgggagctg aacaagtcta tgttggcaca gcacaggcat atgctcatgc 34332 
atctcttcag cactctcagc tcctcggggg tcaaaaccat atcccagggc acggggaact 34392 
cttgcaggac agcgaacccc gcagaacagg gcaatcctcg cacataactt acattgtgca 34452 
tggacagggt atcgcaatca ggcagcaccg ggtgatcctc caccagagaa gcgcgggtct 34512 
cggtctcctc acagcgtggt aagggggccg gccgatacgg gtgatggcgg gacgcggctg 34572 
atcgtgttcg cgaccgtgtt atgatgcagt tgctttcgga cattttcgta cttgctgtag 34632 
cagaacctgg tccgggcgct gcacaccgat cgccggcggc ggtcccggcg cttggaacgc 34692 
tcggtgttga agttgtaaaa cagccactct ctcagaccgt gcagcagatc tagggcctca 34752 
ggagtgatga agatcccatc atgcctgatg gctctaatca catcgaccac cgtggaatgg 34812 
gccagaccca gccagatgat gcaattttgt tgggtttcgg tgacggcggg ggagggaaga 34872 
acaggaagaa ccatgattaa cttttaatcc aaacggtctc ggagcacttc aaaatgaaga 34932 
tcgcggagat ggcacctctc gcccccgctg tgttggtgga aaataacagc caggtcaaag 34992 
gtgatacggt tctcgagatg ttccacggtg gcttccagca aagcctccac gcgcacatcc 35052 
agaaacaaga caatagcgaa agcgggaggg ttctctaatt cctcaatcat catgttacac 35112 
tcctgcacca tccccagata attttcattt ttccagcctt gaatgattcg aactagttcc 35172 
tgaggtaaat ccaagccagc catgataaag agctcgcgca gagcgccctc caccggcatt 35232 
cttaagcaca ccctcataat tccaagatat tctgctcctg gttcacctgc agcagattga 35292 
caagcggaat atcaaaatct ctgccgcgat ccctaagctc ctccctcagc aataactgta 35352 
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agtactcttt 


catatcctct 


ccgaaatttt 


tagccatagg 


accaccagga 


ataagattag 


35412 


ggcaagccac 


agtacagata 


aaccgaagtc 


ctccccagtg 


agcattgcca 


aatgcaagac 


35472 


tgctataagc 


atgctggcta 


gacccggtga 


tatcttccag 


ataactggac 


agaaaatcgc 


35532 


ccaggcaatt 


tttaagaaaa 


tcaacaaaag 


aaaaatcctc 


caggtgcacg tttagagcct 


35592 


cgggaacaac 


gatggagtaa 


atgcaagcgg 


tgcgttccag 


catggttagt 


tagctgatct 


35652 


gtagaaaaaa 


acaaaaatga 


acattaaacc 


atgctagcct 


ggcgaacagg 


tgggtaaatc 


35712 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcac 


gaccctcgta 


aaaattgtcg 


35772 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc 


cggcgtgaat 


gattcgacaa 


35832 


gatgaataca 


cccccggaac 


attggcgtcc 


gcgagtgaaa 


aaaagcgccc 


aaggaagcaa 


35892 


taaggcacta 


caatgctcag 


tctcaagtcc 


agcaaagcga 


tgccatgcgg 


atgaagcaca 


35952 


aaattctcag 


gtgcgtacaa 


aatgtaatta 


ctcccctcct 


gcacaggcag 


caaagccccc 


36012 


gatccctcca 


ggtacacata 


caaagcctca 


gcgtccatag 


cttaccgagc 


agcagcacac 


36072 


aacaggcgca 


agagtcagag 


aaaggctgag 


ctctaacctg 


tccacccgct 


ctctgctcaa 


36132 


tatatagccc 


agatctacac 


tgacgtaaag 


gccaaagtct 


aaaaataccc 


gccaaataat 


36192 


cacacacgcc 


cagcacacgc 


ccagaaaccg 


gtgacacact 


caaaaaaata 


cgcgcacttc 


36252 


ctcaaacgcc 


caaactgccg 


t cafcfctcccrcr 

w v^ti L~ I— ' V-*- V«J KJ 




tacgtcatca 


aaattcgact 




ttcaaattcc 


gtcgaccgtt 


aaaaacgtcg 


cccgccccgc 


ccctaacggt 


cgccgctccc 


36372 


gcagccaatc 


accgccccgc 


atccccaaat 


tcaaatacct 


catttgcata 


ttaacgcgca 


36432 


ccaaaagttt 


gaggtatatt 


attgatgatg 








36462 



<210> 2 
<211> 530 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 
<400> 2 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
15 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 
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Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg 
100 105 HO 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 ~ 140 

Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gin Asp Glu 
I 45 150 155 160 

Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser 
165 170 ^ 175 

Val Thr Met Thr He Asp Leu Met Asn Asn Ala He He Asp Asn Tyr 
180 185 190 

Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly Val 
195 200 205 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu 
210 215 220 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser 
245 250 255 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin 
260 265 270 

He Leu Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 
275 280 285 

Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr 
290 295 300 



Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala 
305 310 315 * " 320 

Ser Ala Ala Thr Leu Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys 
325 330 335 

He Val He Gin Pro Val Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn 
340 345 ' 350 
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Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala 
355 360 365 

Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu 
370 375 380 

Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu 
385 390 395 400 

Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val 
405 410 415 

Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys 
420 425 430 

Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe 
435 440 445 

Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu 
450 455 460 

Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro 
465 470 475 480 

Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Arg Gly 
485 490 " 495 

Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
500 505 ~ 510 

Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg 
515 520 525 



Thr Phe 
530 



<210> 3 
<211> 933 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 
<400> 3 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His He Ala 
15 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 
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Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 

65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 

85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser 

100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 

115 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly 

130 135 " 140 

Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly He Ser 

145 150 155 ~ 160 

He Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro 

165 170 175 

He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala 



Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys 
210 215 220 

Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly 
225 230 235 240 

Gly Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser 
245 250 255 

Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn 
260 265 270 

Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr 
275 280 285 

Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn 
290 295 300 

Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr 
305 310 315 320 

Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin 
325 330 335 

Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr 
340 345 " 350 

Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met 



180 



185 



190 



Glu Trp His Asp He 
195 



Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala 
200 205 



355 



360 



365 
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Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu 
370 375 380 

Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp 
385 390 395 400 

Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala 
405 410 415 

Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu 
420 425 430 

Leu Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn 
435 440 445 

Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp 
450 455 460 

Ser Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn 
465 470 475 480 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp 
485 490 495 

Ala Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn 
500 505 510 

Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 
515 520 * 525 

Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro 
530 535 540 

Gin Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr 
545 . 550 555 560 

Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 
565 570 575 

Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr 
580 585 590 

Ser He Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
595 600 605 

Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
610 615 620 

Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn 
625 630 635 640 

Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe 
645 650 ~ 655 

Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu 
660 665 670 
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Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr 
675 680 ^ 685 

Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He 
690 695 700 

Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
705 710 715 720 

Pro Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 
725 730 735 

Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu 
740 745 750 

Ala His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 
755 760 765 

Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 
770 775 780 

Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu 
785 790 795 800 

Ala Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr 
805 810 " 815 

Met Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
820 825 ~ ~ 830 

Gly Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 
835 840 845 

Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
850 855 860 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
865 870 875 880 

Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu 
885 890 895 

Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
900 905 910 

His Arg Gly Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
915 920 925 



Gly Asn Ala Thr Thr 
930 



<210> 4 

<211> 445 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 
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<400> 4 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe He Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 ** 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu 
50 55 60 

Lys Leu Gly Asp Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser 
65 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

He Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu 
100 105 110 

Gly Met Lys Val Thr Ala Pro Leu Lys He Leu Asp Thr Asp Leu Leu 
115 120 125 

Lys Thr Leu Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr 
130 135 140 

Gly Ala Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn 
145 150 155 160 

Ser Lys He Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 
165 170 " 175 

Asn Arg Leu Asn He Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr Thr 
180 185 " * 190 

Lys Asp Ala Leu Glu Ala Asn He Ser Trp Ala Asn Ala Met Thr Phe 
195 200 205 

He Gly Asn Ala Met Gly Val Asn lie Asp Thr Gin Lys Gly Leu Gin 
210 215 220 

Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn Ala Tyr Pro He 
225 230 235 240 

Gin lie Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala He 
245 250 255 

Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala 
260 265 270 

Asp Pro Ser Pro Asn Cys His He Tyr Ser Glu Lys Asp Ala Lys Leu 
275 280 285 
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Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin He Leu Gly Thr Val Ser 
290 295 300 

Leu He Ala Val Asp Thr Gly Ser Leu Asn Pro He Thr Gly Thr Val 
305 310 315 320 

Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gin 
325 330 ~ 335 

Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp 
340 345 350 

Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala He Gly Phe Met Pro Asn 
355 360 365 

Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His He 
37 0 375 380 

Val Gly Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu 
385 390 395 400 

He He Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys He 
405 410 ~ 415 

Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr Leu 
420 425 430 

Ala Val Ser Ser Phe Thr Phe Ser Tyr He Ala Lys Glu 
435 440 445 

<210> 5 
<211> 36604 
<212> DNA 

<213> chimpanzee adenovirus serotype Pan6 
<220> 

<221> CDS 

<222> (13878) . . (15467) 
<223> L2 Penton 

<220> 

<221> CDS 

<222> (18284) . . (21112) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32162) (33493) 

<223> L5 Fiber 

<400> 5 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 
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tgacgttttg atgacgtggc tatgaggcgg agccggtttg caagttctcg tgggaaaagt 180 

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 

aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtgacgaccc tccagagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagagcgacc ctaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgcg gtgaaccagg gagtgaaaac tgcgggcgag agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 

tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 

agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 

ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 

ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 

atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560 

gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gactgtcttg 1620 

gaagactttc accagactag acagttgcta gagaactcat cggagggagt ctcttacctg 1680 

tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 

aaggaacaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 

ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 
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agaactaccg ccgcggtagc cttttttgcc tttattcttg acaaatggag tcaagaaacc 1920 

catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 

tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 

atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 

cagcaagagg aggaccgaga agagaacccg agagccggtc tggaccctcc ggtggcggag 2160 

gaggaggagt agctgacttg tttcccgagc tgcgccgggt gctgactagg tcttccagtg 2220 

gacgggagag ggggattaag cgggagaggc atgaggagac tagccacaga actgaactga 2280 

ctgtcagtct gatgagccgc aggcgcccag aatcggtgtg gtggcatgag gtgcagtcgc 2340 

aggggataga tgaggtctcg gtgatgcatg agaaatattc cctagaacaa gtcaagactt 2400 

gttggttgga gcccgaggat gattgggagg tagccatcag gaattatgcc aagctggctc 2460 

tgaagccaga caagaagtac aagattacca aactgattaa tatcagaaat tcctgctaca 2520 

tttcagggaa tggggccgag gtggagatca gtacccagga gagggtggcc ttcagatgtt 2580 

gtatgatgaa tatgtacccg ggggtggtgg gcatggaggg agtcaccttt atgaacacga 2640 

ggttcagggg tgatgggtat aatggggtgg tctttatggc caacaccaag ctgacagtgc 2700 

acggatgctc cttctttggc ttcaataaca tgtgcatcga ggcctggggc agtgtttcag 2760 

tgaggggatg cagcttttca gccaactgga tgggggtcgt gggcagaacc aagagcaagg 2820 

tgtcagtgaa gaaatgcctg ttcgagaggt gccacctggg ggtgatgagc gagggcgaag 2880 

ccaaagtcaa acactgcgcc tctaccgaga cgggctgctt tgtgctgatc aagggcaatg 2940 

cccaagtcaa gcataacatg atctgtgggg cctcggatga gcgcggctac cagatgctga 3000 

cctgcgccgg tgggaacagc catatgctgg ccaccgtgca tgtggcctcg cacccccgca 3060 

agacatggcc cgagttcgag cacaacgtca tgacccgctg caatgtgcac ctgggctccc 3120 

gccgaggcat gttcatgccc taccagtgca acatgcaatt tgtgaaggtg ctgctggagc 3180 

ccgatgccat gtccagagtg agcctgacgg gggtgtttga catgaatgtg gagctgtgga 3240 

aaattctgag atatgatgaa tccaagacca ggtgccgggc ctgcgaatgc ggaggcaagc 3300 

acgccaggct tcagcccgtg tgtgtggagg tgacggagga cctgcgaccc gatcatttgg 3360 

tgttgtcctg caacgggacg gagttcggct ccagcgggga agaatctgac tagagtgagt 3420 

agtgtttggg gctgggtgtg agcctgcatg aggggcagaa tgactaaaat ctgtggtttt 3480 

ctgtgtgttg cagcagcatg agcggaagcg cctcctttga gggaggggta ttcagccctt 3540 
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atctgacggg gcgtctcccc tcctgggcgg gagtgcgtca gaatgtgatg ggatccacgg 3600 

tggacggccg gcccgtgcag cccgcgaact cttcaaccct gacctacgcg accctgagct 3660 

cctcgtccgt ggacgcagct gccgccgcag ctgctgcttc cgccgccagc gccgtgcgcg 3720 

gaatggccct gggcgccggc tactacagct ctctggtggc caactcgagt tccaccaata 3780 

atcccgccag cctgaacgag gagaagctgc tgctgctgat ggcccagctc gaggccctga 3840 

cccagcgcct gggcgagctg acccagcagg tggctcagct gcaggcggag acgcgggccg 3900 

cggttgccac ggtgaaaacc aaataaaaaa tgaatcaata aataaacgga gacggttgtt 3960 

gattttaaca cagagtcttg aatctttatt tgatttttcg cgcgcggtag gccctggacc 4020 

accggtctcg atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt 4080 

ggatgttgag gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg 4140 

cctcgtgctc ggggatggtg ttgtaaatca cccagtcata gcaggggcgc agggcgtggt 4200 

gctgcacgat gtccttgagg aggagactga tggccacggg cagccccttg gtgtaggtgt 4260 

tgacgaacct gttgagctgg gagggatgca tgcgggggga gatgagatgc atcttggcct 4320 

ggatcttgag attggcgatg ttcccgccca gatcccgccg ggggttcatg ttgtgcagga 4380 

ccaccagcac ggtgtatccg gtgcacttgg ggaatttgtc atgcaacttg gaagggaagg 4440 

cgtgaaagaa tttggagacg cccttgtgac cgcccaggtt ttccatgcac tcatccatga 4500 

tgatggcgat gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat 4560 

cgtagttgtg gtcctgggtg agctcgtcat aggccatttt aatgaatttg gggcggaggg 4620 

tgcccgactg ggggacgaag gtgccctcga tcccgggggc gtagttgccc tcgcagatct 4680 

gcatctccca ggccttgagc tcggaggggg ggatcatgtc cacctgcggg gcgatgaaaa 4740 

aaacggtttc cggggcgggg gagatgagct gggccgaaag caggttccgg agcagctggg 4800 

acttgccgca accggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga 4860 

gggagagaca gctgccgtcc tcgcggagga ggggggccac ctcgttcatc atctcgcgca 4920 

catgcatgtt ctcgcgcacg agttccgcca ggaggcgctc gccccccagc gagaggagct 4980 

cttgcagcga ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga 5040 

gggtctgttg caagagttcc agacggtccc agagctcggt gatgtgctct agggcatctc 5100 

gatccagcag acctcctcgt ttcgcgggtt ggggcgactg cgggagtagg gcaccaggcg 5160 

atgggcgtcc agcgaggcca gggtccggtc cttccagggc cgcagggtcc gcgtcagcgt 5220 

ggtctccgtc acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag 5280 



38 



WO 03/046124 



PCT/US02/33645 



gctcatccgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 5340 
attgagcatg agttcgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 5400 
tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460 
gaggaagacg gactcggggg cgtaggcgtc cgcgccgcag ctggcgcaga cggtctcgca 5520 
ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 5580 
tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 5640 
gctgtccgtg tccccgtaga ccgactttat gggccggtcc tcgagcgggg tgccgcggtc 5700 
ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 5760 
ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tctccagggt 5820 
atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 58 80 
cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gctcgtcctc 5940 
actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 6000 
ggcgggcatg acctcggcac tcaggttgtc agtttctaga aacgaggagg atttgatatt 6060 
gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 6120 

ctttttgttg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 6180 

gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 6240 

ctgcacgtac tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 6300 

cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 6360 

ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcg agcagaaggg 6420 

gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480 

caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540 

gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600 

cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660 

gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 6720 

ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 6780 

gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 6840 

gttgaagtgg gcgtggggca ggccgaccga gtccctgatg aagtgggcgt aggagtcctg 6900 

cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960 
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ttggatgatg 
ctcttcgcgg 
gcccaccatg 
ggcgtaagct 
catgaccttg 
ttggaagtcc 
gaggatcttg 
ggcccggttg 
cccgacgatg 
ctcgtcgtag 
gacgtggggg 
gcggtcccgg 
gaaggtgcgg 
gagctcgacg 
cttgccgaag 
ggtgcgagga 
gctgttgatg 
atacaagcgt 
ctgggttcct 
ctgtactacg 
gagcccgcgc 
ggcgcgcagg 
cggcggcgcg 
cttgatctcc 
gggcgccacc 
cggcgaggac 
ggcacgtcgg 
gcgacgacgc 
gtgagtttga 



tcatacttga 
tccttccagt 
tagaactggt 
tgcgcggcct 
aggaactggt 
gtgcgcttct 
cccgcgcggg 
ttgatgacct 
tagagttcca 
gtgagctcgg 
ttggcgctga 
tactgacgga 
gggtcgccgt 
agcggcgggt 
gaccccatcc 
tgcgagccga 
tgatggaagt 
ccgcagtgct 
ttggcgagga 
tcttggccat 
gggaggcagg 
ccggagctgt 

°ggttgactt 

acggcgccgt 
accgtgcccc 
gcgcgccggg 
cgccgcgcgc 
gacggttgac 
acctgaaaga 



gctggccctt 
actcttcgag 
tgacggcctt 
tgcgcaggga 
gcttgaagtc 
tgtaggcggg 
gcatgaagtt 
gggcggcgag 
cgaatcgcgg 
cggggtcgct 
ggaaggaagt 
actgttggcc 
gccagcggtc 
ccccggagag 
aggtgtaggt 
tggggaagaa 
agaaatgccg 
cgcaacgctg 
atttcagtgg 
cggcgtggcc 
tccagacctc 
ccagggtcct 
gcaggagctt 
tggtggctac 
gtttcttctt 
cggcaggggc 
gggcaggttc 
gtcctggatc 
gagttcgaca 



ctgcttccac 
ggggaacccg 
gtaggcgcag 
ggtgtgggtg 
gaggtcgtcg 
gttaggcaaa 
gcgagtgatg 
gacgatctcg 
gcggcccttg 
gagcccgtgc 
ccagagatcc 
cacggccatt 
ccacttgagc 
tttcatgacc 
ttccacatcg 
ctggatctcc 
acggcgcgcc 
cacgggatgc 
gcagtggagc 
atcgtctgcc 
ggctcggacg 
gagacgctgc 
ttccagggcg 
gtccacggct 
gggcgctgct 
ggctcggggc 
tggtactgcg 
tgacgcctct 
gaatcaatct 



agctcgcggt 
tcctgatcgg 
cagcccttct 
agggcgaagg 
cagccgccct 
gcgaaagtaa 
cggaaaggct 
tcgaagccgt 
acgtggggca 
tgctcgaggg 
acggccaggg 
ttttcggggg 
tggagggcga 
agcatgaagg 
taggtgagga 
tgccaccagt 
gagcactcgt 
acgtgctgca 
gctggcggct 
tcgatggtgg 
ggtcggagag 
ggagtcaggt 
cgcgggaggt 
tgcagggtgc 
tccatgtcgg 
ccggaggcag 
cccggagaag 
gggtgaaggc 
cggtatcgtt 



tgagaaggaa 
cacggtaa'ga 
ccacggggag 
tgtcgcgcac 
gctcccagag 
catcgttgaa 
ggggcacctc 
tgatgttgtg 
gcttcttgag 
cccagtcggc 
cggtctgcaa 
tgacgcagta 
ggtcgtgggc 
ggacgagctg 
agagcctttc 
tggaggaatg 
gcttgtgttt 
cgagctgtac 
gcatctcgtg 
tcatgctgac 
cgaggacgag 
cagtgggcag 
ccagatggta 
cgtgcccctg 
tcagaagcgg 
gggcggcagg 
actggcgtga 
cacgggaccc 
gacggcggcc 



7020 

7080 

7140 

7200 

7260 
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7380 
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7500 

7560 
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7680 
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8040 

8100 

8160 

8220 

8280 
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8400 
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8700 
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tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 

tgctcgatct cctcctcctg aaggtctccg cggccggcgc gctcgacggt ggccgcgagg 8820 

tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880 

cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 

agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 

gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060 

ctgacgtcgc ccagggcttc caagcgttcc atggcctcgt agaagtccac ggcgaagttg 9120 

aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180 

gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc catctcctcc 9240 

tcttcctcct ccactaacat ctcttctact tcctcctcag gaggcggtgg cgggggaggg 9300 

gccctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat ggtctccccg 9360 

cgccggcgac gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg cagcatgaag 9420 

acgccgccgc gcatctccag gtggccgccg ggggggtctc cgttgggcag ggagagggcg 9480 

ctgacgatgc atcttatcaa ttgacccgta gggactccgc gcaaggacct gagcgtctcg 9540 

agatccacgg gatccgaaaa ccgctgaacg aaggcttcga gccagtcgca gtcgcaaggt 9600 

aggctgagcc cggtttcttg ttcttcgggt atttggtcgg gaggcgggcg ggcgatgctg 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct gcggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggt gaggatggcc tgctggatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtgtagga gcagttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 

gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggcgc gcacgaggta ctggtatccg 1014 0 

acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 

ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260 

atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 

agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380 
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atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 
gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 
ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 
ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 
ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 
cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 
cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 
ggagcgagcc cctctttttt tttcttgtgt ttttgccaga tgcatcccgt actgcggcag 10860 
atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920 
ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980 
gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtcgtcg 11040 
ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100 
cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160 
cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220 
ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280 
ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340 
aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400 
gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460 
ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520 
gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580 
gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gctgagtctg 11640 
ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700 
aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760 
ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820 
agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880 
gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940 
gctgccggcg gttcccccta cgtggaggag gtggacgatg aggaggagga gggcgagtac 12000 
ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060 
ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120 
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acgattggac ccaggccatg caacgcatca tggcgctgac gacccgcaat cccgaagcct 12180 

ttagacagca gcctcaggcc aaccggctct cggccatcct ggaggccgtg gtgccctcgc 12240 

gctcgaaccc cacgcacgag aaggtgctgg ccatcgtgaa cgcgctggtg gagaacaagg 12300 

ccatccgcgg tgacgaggcc gggctggtgt acaacgcgct gctggagcgc gtggcccgct 12360 

acaacagcac caacgtgcag acgaacctgg accgcatggt gaccgacgtg cgcgaggcgg 12420 

tgtcgcagcg cgagcggttc caccgcgagt cgaacctggg ctccatggtg gcgctgaacg 12480 

ccttcctgag cacgcagccc gccaacgtgc cccggggcca ggaggactac accaacttca 12540 

tcagcgcgct gcggctgatg gtggccgagg tgccccagag cgaggtgtac cagtcggggc 12600 

cggactactt cttccagacc agtcgccagg gcttgcagac cgtgaacctg agccaggctt 12660 

tcaagaactt gcagggactg tggggcgtgc aggccccggt cggggaccgc gcgacggtgt 12720 

cgagcctgct gacgccgaac tcgcgcctgc tgctgctgct ggtggcgccc ttcacggaca 12780 

gcggcagcgt gagccgcgac tcgtacctgg gctacctgct taacctgtac cgcgaggcca 12840 

tcggacaggc gcacgtggac gagcagacct accaggagat cacccacgtg agccgcgcgc 12900 

tgggccagga ggacccgggc aacctggagg ccaccctgaa cttcctgctg accaaccggt 12960 

cgcagaagat cccgccccag tacgcgctga gcaccgagga ggagcgcatc ctgcgctacg 13020 

tgcagcagag cgtggggctg ttcctgatgc aggagggggc cacgcccagc gcggcgctcg 13080 

acatgaccgc gcgcaacatg gagcccagca tgtacgcccg caaccgcccg ttcatcaata 13140 

agctgatgga ctacttgcat cgggcggccg ccatgaactc ggactacttt accaacgcca 13200 

tcttgaaccc gcactggctc ccgccgcccg ggttctacac gggcgagtac gacatgcccg 13260 

accccaacga cgggttcctg tgggacgacg tggacagcag cgtgttctcg ccgcgtccag 13320 

gaaccaatgc cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg 13380 

gtcgcgcggg tgctgccgcg gcggtgcccg aggccgccag ccccttcccg agcctgccct 13440 

tttcgctgaa cagcgtgcgc agcagcgagc tgggtcggct gacgcgaccg cgcctgctgg 13500 

gcgaggagga gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca 13560 

ataacgggat agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc 13620 

acagggacga gccccgagct agcagcgcag gcacccgtag acgccagcgg cacgacaggc 13680 

agcggggact ggtgtgggac gatgaggatt ccgccgacga cagcagcgtg ttggacttgg 13740 

gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc ctgatgtaag 13800 
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aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860 

ctgttgtttg tagtagt atg atg agg cgc gtg tac ccg gag ggt cct cct 13910 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro 
15 10 

ccc teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag 13958 
Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin 
15 20 25 

ccc ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg 14006 
Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr 
30 35 40 

gag ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat 14054 
Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp 
45 50 55 

acc ace egg ttg tac ctg gtg gac aac aag teg gca gac ate gee teg 14102 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser 
60 65 70 75 

ctg aac tac cag aac gac cac age aac ttc ctg acc acc gtg gtg cag 14150 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin 
80 85 90 

aac aac gat ttc acc ccc acg gag gec age acc cag acc ate aac ttt 14198 
Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe 
95 100 105 

gac gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc 14246 
Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr He Met His Thr 
110 115 120 

aac atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg 14294 
Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala 
125 130 135 

egg gtg atg gtc teg cgc aag acc ccc aac ggg gtg gat gat gat tat 14342 
Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr 
140 145 150 155 

gat ggt agt cag gac gag ctg acc tac gag tgg gtg gag ttt gag ctg 14390 
Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu 
160 165 170 

ccc gag ggc aac ttc teg gtg acc atg acc ate gat ctg atg aac aac 14438 
Pro Glu Gly Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn 
175 180 185 

gec ate ate gac aac tac ttg gcg gtg ggg egg cag aac ggg gtg ctg 14486 
Ala He lie Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu 
190 195 200 

gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg ggc 14534 
Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
205 210 215 
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tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac gag 14582 
Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
220 225 230 " 235 

gcc ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac ttc 14630 
Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe 
240 245 250 

acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag ccc 14678 
Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro 
255 260 265 

lit ?? C lt C C f 9 ° tg taC gag gaC Ctg gag 99 c aac 14726 

Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu Gly Gly Asn 

270 275 280 

ate ccc gcg etc ttg gat gtc gaa gcc tac gag aaa age aag gag gat 14774 
He Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp 
285 290 " 295 

age acc gcc gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag gtg 14822 
Ser Thr Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val 
300 305 310 315 

egg ggc gat aat ttt get age get gcg gca gcg gcc gag gcg get gaa 14870 
Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu 
320 325 330 



acc gaa agt aag ata gtc ate cag ccg gtg gag aag gac age aag gac 
Thr Glu Ser Lys He Val He Gin Pro Val Glu Lys Asp Ser Lys Asp 
335 340 345 

agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc age 
Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser 
350 355 3 6 o 

tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc tec 
Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser 



365 370 



375 



tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa gtc 
Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val 
380 385 



590 395 



14918 



14966 



15014 



15062 



15110 



tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tec 
Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser 
400 405 4io 

acg cgt caa gtt age aac tac ccg gtg gtg ggc gcc gag etc ctg ccc 15158 
Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro 
415 420 425 

gtc tac tec aag age ttc ttc aac gag cag gcc gtc tac teg cag cag 15206 
Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin 
430 435 440 
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ctg cgc gcc ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc gag 15254 
Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu 
445 450 455 

aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15302 
Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser 
460 465 470 475 

gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc age 15350 
Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser 
480 485 490 

agt ate egg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc 15398 
Ser He Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg 
495 500 505 

acc tgc ccc tac gtc tac aag gcc ctg ggc gta gtc gcg ccg cgc gtc 15446 
Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val 
510 515 520 

etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 15497 
Leu Ser Ser Arg Thr Phe 
525 



aacaceggtt 


qgcrcrcctaca 


coccjcccacf c 






f» /->» -> /-» t-t (-* 4- i-* i—i 
L-L-aaLtJ LLUL. 


Ijjj / 


acgcaacacc 


ccgtgcgcgt 


gccrcggacac 


ttccgcgctc 


cctcrCTcrcrecre 

W W W \J \£ \£ ^ *S 
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cgcgtgcgct 


cgcgcaccac 


cgtcgacgac 


gtgatcgacc 


aggtggtggc 


cgacgcgcgc 


15677 


aactacacgc 


ccgccgccgc 


gcccgtctcc 


accgtggacg 


ccgtcatcga 


cagcgtggtg 


15737 


gccgacgcgc 


gccggtacgc 


ccgcaccaag 


agccggcggc 


ggcgcatcgc 


ccggcggcac 


15797 


cggagcaccc 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


geagggecag 


gcgcacggga 


15857 


cgcagggcca 


tgetcaggge 


ggccagacgc 


gcggcctccg 


gcagcagcag 


cgccggcagg 


15917 


acccgcagac 


gcgcggccac 


ggcggcggcg 


gcggccatcg 


ccagcatgtc 


ccgcccgcgg 


15977 


cgcggcaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


tgcgcgtgcc 


cgtgcgcacc 


16037 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 


atgtgtccca 


gcggcgagga 


16097 


ggatgtccaa 


gegcaaatae 


aaggaagaga 


tgctccaggt 


catcgcgcct 


gagatctacg 


16157 


gccccgcggc 


ggcggtgaag 


gaggaaagaa 


agccccgcaa 


actgaagegg 


gtcaaaaagg 


16217 


acaaaaagga 


ggaggaagat 


gaeggactgg 


tggagtttgt 


gcgcgagttc 


gccccccggc 


16277 


ggcgcgtgca 


gtggcgcggg 


cggaaagtga 


aaceggtget 


gcggcccggc 


accacggtgg 


16337 


tcttcacgcc 


eggegagegt 


tccggctccg 


cctccaagcg 


ctcctacgac 


gaggtgtacg 


16397 


gggacgagga 


catcctcgag 


caggeggteg 


agcgtctggg 


cgagtttgcg 


taeggcaage 


16457 


gcagccgccc 


cgcgcccttg 


aaagaggagg 


cggtgtccat 


cccgctggac 


cacggcaacc 


16517 
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ccacgccgag cctgaagccg gtgaccctgc agcaggtgct accgagcgcg gcgccgcgcc 16577 
ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac catgcagctg atggtgccca 16637 
agcgccagaa gctggaggac gtgctggagc acatgaaggt ggaccccgag gtgcagcccg 16697 
aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct gggcgtgcag accgtggaca 16757 
tcaagatccc cacggagccc atggaaacgc agaccgagcc cgtgaagccc agcaccagca 16817 
ccatggaggt gcagacggat ccctggatgc cagcaccagc ttccaccagc actcgccgaa 16877 
gacgcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 16937 
tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg ctacaccagc agccgccgcc 16997 
gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag cagcaccgcg acttccgcct 17057 
tggtgcggag agtgtatcgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17117 
acccgagcat cgccatttaa ctaccgcctc ctacttgcag atatggccct cacatgccgc 17177 
ctccgcgtcc ccattacggg ctaccgagga agaaagccgc gccgtagaag gctgacgggg 17237 
aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga 17297 
ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga tcggggcgat ccccggcata 17357 
gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca caaaaaagca tggatttgta 17417 
ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg tgtgttttta gatggaagac 17477 
atcaattttt cgtccctggc accgcgacac ggcacgcggc cgtttatggg cacctggagc 17537 
gacatcggca acagccaact gaacgggggc gccttcaatt ggagcagtct ctggagcggg 17597 
cttaagaatt tcgggtccac gctcaaaacc tatggcaaca aggcgtggaa cagcagcaca 17657 
gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc agcagaaggt ggttgatggc 17717 
ctggcctcag gcatcaacgg ggtggttgac ctggccaacc aggccgtgca gaaacagatc 17777 
aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg agatgcccca ggtggaggag 17837 
gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc gtcccgacgc ggaggagacg 17897 
ctgctgacgc acacggacga gccgcccccg tacgaggagg cggtgaaact gggcctgccc 17957 
accacgcggc ccgtggcgcc tctggccacc ggagtgctga aacccagcag cagccagccc 18017 
gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg ctaagcccct gccgccggtg 18077 
gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga actggcagag cactctgaac 18137 
agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aaagacactg 18197 
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tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg ccgaccagaa ggaggagtgt 18257 

gaagaggcgc gtcgccgagt tgcaag atg gcc acc cca teg atg ctg ccc cag 18310 

Met Ala Thr Pro Ser Met Leu Pro Gin 
530 535 

tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg agt 18358 
Trp Ala Tyr Met His He Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser 
540 545 550 

ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg 18406 
Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu 
555 560 565 570 

ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18454 
Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr 
575 580 585 



acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac cgc 
Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg 
590 595 600 

gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg ggc 
Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly 
605 610 615 



act gac gtt aca gcg aat cag aat aaa cca att tat gcc gac aaa aca 
Thr Asp Val Thr Ala Asn Gin Asn Lys Pro He Tyr Ala Asp Lys Thr 
700 705 710 



18502 



18550 



gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc ggc 18598 
Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg Gly 
620 625 630 

gtg ctg gac egg ggc cct age ttc aaa ccc tac tct ggc acc gcc tac 18646 
Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr 
635 640 645 " 650 

aac age eta get ccc aag gga get ccc aat tec age cag tgg gag caa 18694 
Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin 
655 660 665 

gca aaa aca ggc aat ggg gga act atg gaa aca cac aca tat ggt gtg 18742 
Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val 
670 675 680 

gcc cca atg ggc gga gag aat att aca aaa gat ggt ctt caa att gga 18790 
Ala Pro Met Gly Gly Glu Asn He Thr Lys Asp Gly Leu Gin He Gly 
685 690 695 



18838 



ttt caa cca gaa ccg caa gta gga gaa gaa aat tgg caa gaa act gaa 18886 

Phe Gin Pro Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu 

715 720 725 * 730 

aac ttt tat ggc ggt aga get ctt aaa aaa gac aca aac atg aaa cct 18934 

Asn Phe Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro 

735 740 745 
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tgc tat ggc tec tat get aga ccc ace aat gaa aaa gga ggt caa get 18982 
Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala 
750 755 - ' 760 

aaa ctt aaa gtt gga gat gat gga gtt cca aec aaa gaa ttc gac ata 19030 
Lys Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp lie 
765 770 775 

gac ctg get ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa gac 19078 
Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp 
780 785 790 

gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat ttg gaa 19126 
Glu Tyr Lys Ala Asp He Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu 
795 800 805 " 810 

act cca gac acg cat gtg gta tac aaa cca ggc aag gat gat gca agt 19174 
Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser 
815 820 825 

tct gaa att aac ctg gtt cag cag tct atg ccc aac aga ccc aac tac 19222 
Ser Glu He Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr 
830 835 840 

att ggg ttc agg gac aac ttt ate ggt ctt atg tac tac aac age act 19270 
He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr 
845 850 855 

ggc aat atg ggt gtg ctt get ggt cag gec tec cag ctg aat get gtg 19318 
Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val 
860 865 870 

gtt gat ttg caa gac aga aac acc gag ctg tec tac cag etc ttg ctt 19366 
Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu 
875 880 885 890 

gac tct ttg ggt gac aga acc egg tat ttc agt atg tgg aac cag gcg 19414 
Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala 
895 900 905 

gtg gac agt tat gac ccc gat gtg cgc ate ate gaa aac cat ggt gtg 19462 
Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Val 
910 915 920 

gag gat gaa ttg cca aac tat tgc ttc ccc ttg gac ggc tct ggc act 19510 
Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr 
925 930 935 

aac gee gca tac caa ggt gtg aaa gta aaa gat ggt caa gat ggt gat 19558 
Asn Ala Ala Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp 
940 945 950 

gtt gag agt gaa tgg gaa aat gac gat act gtt gca get cga aat caa 19606 
Val Glu Ser Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin 
955 960 965 970 
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tta tgt aaa ggt aac att ttc gcc atg gag att aat etc cag get aac 
Leu Cys Lys Gly Asn He Phe Ala Met Glu He Asn Leu Gin Ala Asn 
975 980 985 

ctg tgg aga agt ttc etc tac teg aac gtg gcc ctg tac ctg ccc gac 
Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp 
990 995 1000 

tec tac aag tac acg ccg ace aac gtc acg ctg ccg acc aac acc 
Ser Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr 
1005 1010 1015 

aac acc tac gat tac atg aat ggc aga gtg aca cct ccc teg ctg 
Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu 
1020 1025 1030 

gta gac gcc tac etc aac ate ggg gcg cgc tgg teg ctg gac ccc 
Val Asp Ala Tyr Leu Asn He Gly Ala Arg Trp Ser Leu Asp Pro 
1035 1040 1045 

atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg 
Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 
1050 1055 1060 

cgc tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc 
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe 
1065 1070 1075 

cac ate cag gtg ccc caa aag ttt ttc gcc ate aag age etc ctg 
His He Gin Val Pro Gin Lys Phe Phe Ala lie Lys Ser Leu Leu 
1080 1085 1090 

etc ctg ccc ggg tec tac acc tac gag tgg aac ttc cgc aag gac 
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp 
1095 1100 H05 

gtc aac atg ate ctg cag age tec eta ggc aac gac ctg cgc acg 
Val Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr 
1110 1115 H20 

gac ggg gcc tec ate gcc ttc acc age ate aac etc tac gcc acc 
Asp Gly Ala Ser lie Ala Phe Thr Ser He Asn Leu Tyr Ala Thr 
1125 H30 H35 

ttc ttc ccc atg gcg cac aac acc gcc tec acg etc gag gcc atg 
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met 
H40 H45 H50 

ctg cgc aac gac acc aac gac cag tec ttc aac gac tac etc teg 
Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser 
II 55 1160 H65 

gcg gcc aac atg etc tac ccc ate ccg gcc aac gcc acc aac gtg 
Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala Thr Asn Val 
H70 H75 H80 



19654 



19702 



19747 



19792 



19837 



19882 



19927 



19972 



20017 



20062 



20107 



20152 



20197 



20242 
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ccc ate tec ate ccc teg cgc aac tgg gee gee ttc cgc gga tgg 
Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp 
1185 1190 * 1195 



20287 



tec ttc acg cgc ctg aag acc cgc 
Ser Phe Thr Arg Leu Lys Thr Arg 
1200 1205 



gag acg ccc teg etc ggc tec 
Glu Thr Pro Ser Leu Gly Ser 
1210 



ggg ttc gac ccc tac ttc gtc tac teg ggc tec ate ccc tac eta 
Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu 
1215 1220 1225 



20332 



20377 



gac ggc acc ttc tac etc aac cac acc ttc aag aag gtc tec ate 
Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He 
1230 1235 1240 



20422 



acc ttc gac tec tec gtc age tgg 
Thr Phe Asp Ser Ser Val Ser Trp 
1245 1250 



ccc ggc aac gac cgc etc ctg 
Pro Gly Asn Asp Arg Leu Leu 
1255 



20467 



acg ccc aac gag ttc gaa ate aag cgc acc gtc gac gga gag gga 
Thr Pro Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly 
1260 1265 " 1270 



20512 



tac aac gtg gec cag tgc aac atg acc aag gac tgg ttc ctg gtc 
Tyr Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val 
1275 1280 * 1285 



20557 



cag atg ctg gee cac tac aac ate ggc tac cag ggc ttc tac gtg 
Gin Met Leu Ala His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val 
1290 1295 1300 



20602 



ccc gag ggc tac aag gac cgc atg tac tec ttc ttc cgc aac ttc 
Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe 
1305 1310 1315 



20647 



cag ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac 
Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp 
1320 1325 1330 



20692 



tac cag gee gtc acc ctg gee tac cag cac aac aac teg ggc ttc 
Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe 
1335 1340 1345 



20737 



gtc ggc tac etc gcg ccc acc atg cgc cag ggc cag ccc tac ccc 
Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro 
1350 1355 1360 ~ 



20782 



gee aac tac ccc tac ccg etc ate ggc aag age gee gtc gee age 
Ala Asn Tyr Pro Tyr Pro Leu He Gly Lys Ser Ala Val Ala Ser 
1365 1370 " 1375 



20827 



gtc acc cag 
Val Thr Gin 
1380 



aaa aag ttc etc tgc gac egg gtc atg tgg cgc ate 
Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He 
1385 1390 



20872 
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ccc ttc tec age aac ttc atg tec atg ggc gcg etc ace gac etc 20917 
Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu 
1395 1400 1405 

ggc cag aac atg etc tac gee aac tec gee cac gcg eta gac atg 20962 
Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met 
1410 1415 1420 

aat ttc gaa gtc gac ccc atg gat gag tec acc ctt etc tat gtt 21007 
Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val 
1425 1430 1435 

gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc 21052 
Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg 
1440 1445 1450 

ggc gtc ate gaa gee gtc tac ctg cgc acg ccc ttc teg gee ggc 21097 
Gly Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
1455 1460 1465 

aac gee acc acc taa gccgctcttg ettcttgeaa gatgaeggeg ggctccggcg 21152 
Asn Ala Thr Thr 
1470 

agcaggagct cagggccatc ctccgcgacc tgggctgegg gccctgcttc ctgggcacct 21212 

tegacaageg cttccctgga ttcatggccc cgcacaagct ggcctgcgcc ategtgaaca 21272 

cggccggccg egagaceggg ggegagcact ggctggcctt cgcctggaac .ccgcgctccc 21332 

acacatgeta cctcttcgac cccttcgggt teteggaega gcgcctcaag cagatctacc 21392 

agttcgagta egagggectg ctgcgtcgca gcgccctggc caccgaggac cgctgcgtca 21452 

ccctggaaaa gtccacccag acegtgeagg gtccgcgctc ggccgcctgc gggctcttct 21512 

gctgcatgtt cctgcacgcc ttcgtgcact ggcccgaccg ccccatggac aagaacccca 21572 

ccatgaactt actgaegggg gtgcccaacg gcatgctcca gtcgccccag gtggaaccca 21632 

ccctgcgccg caaccaggaa gcgctctacc gcttcctcaa tgcccactcc gcctactttc 21692 

gctcccaccg cgcgcgcatc gagaaggeca ccgccttcga ccgcatgaat caagacatgt 21752 

aaaaaacegg tgtgtgtatg tgaatgcttt attcataata aacagcacat gtttatgeca 21812 

ccttctctga ggctctgact ttatttagaa atcgaagggg ttctgccggc teteggcatg 21872 

gcccgcgggc agggatacgt tgcggaactg gtacttgggc agecacttga acteggggat 21932 

cagcagcttg ggcacgggga ggtcggggaa cgagtcgctc cacagcttgc gcgtgagttg 21992 

cagggcgccc ageaggtegg gegeggagat cttgaaatcg cagttgggac ccgcgttctg 22052 

cgegegagag ttgeggtaca eggggttgea gcactggaac accatcaggg ccgggtgctt 22112 

cacgcttgcc agcaccgtcg cgtcggtgat gccctccacg tccagatcct cggcgttggc 22172 
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catcccgaag ggggtcatct tgcaggtctg ccgccccatg ctgggcacgc agccgggctt 22232 

gtggttgcaa tcgcagtgca gggggatcag catcatctgg gcctgctcgg agctcatgcc 22292 

cgggtacatg gccttcatga aagcctccag ctggcggaag gcctgctgcg ccttgccgcc 22352 

ctcggtgaag aagaccccgc aggacttgct agagaactgg ttggtggcgc agccggcgtc 22412 

gtgcacgcag cagcgcgcgt cgttgttggc cagctgcacc acgctgcgcc cccagcggtt 22472 

ctgggtgatc ttggcccggt tggggttctc cttcagcgcg cgctgcccgt tctcgctcgc 22532 

cacatccatc tcgatagtgt gctccttctg gatcatcacg gtcccgtgca ggcaccgcag 22592 

cttgccctcg gcttcggtgc agccgtgcag ccacagcgcg cagccggtgc actcccagtt 22652 

cttgtgggcg atctgggagt gcgagtgcac gaagccctgc aggaagcggc ccatcatcgc 22712 

ggtcagggtc ttgttgctgg tgaaggtcag cgggatgccg cggtgctcct cgttcacata 22772 

caggtggcag atgcggcggt acacctcgcc ctgctcgggc atcagctgga aggcggactt 22832 

caggtcgctc tccacgcggt accggtccat cagcagcgtc atcacttcca tgcccttctc 22892 

ccaggccgaa acgatcggca ggctcagggg gttcttcacc gccattgtca tcttagtcgc 22952 

cgccgccgag gtcagggggt cgttctcgtc cagggtctca aacactcgct tgccgtcctt 23012 

ctcgatgatg cgcacggggg gaaagctgaa gcccacggcc gccagctcct cctcggcctg 23072 

cctttcgtcc tcgctgtcct ggctgatgtc ttgcaaaggc acatgcttgg tcttgcgggg 23132 . 

tttctttttg ggcggcagag gcggcggcga tgtgctggga gagcgcgagt tctcgttcac 23192 

cacgactatt tcttcttctt ggccgtcgtc cgagaccacg cggcggtagg catgcctctt 23252 

ctggggcaga ggcggaggcg acgggctctc gcggttcggc gggcggctgg cagagccccf 23312 

tccgcgttcg ggggtgcgct cctggcggcg ctgctctgac tgacttcctc cgcggccggc 23372 

cattgtgttc tcctagggag caacaacaag catggagact cagccatcgt cgccaacatc 23432 

gccatctgcc cccgccgcca ccgccgacga gaaccagcag cagaatgaaa gcttaaccgc 23492 

cccgccgccc agccccacct ccgacgccgc ggccccagac atgcaagaga tggaggaatc 23552 

catcgagatt gacctgggct acgtgacgcc cgcggagcac gaggaggagc tggcagcgcg 23612 

cttttcagcc ccggaagaga accaccaaga gcagccagag caggaagcag agaacgagca 23672 

gaaccaggct gggcacgagc atggcgacta cctgagcggg gcagaggacg tgctcatcaa 23732 

gcatctggcc cgccaatgca tcatcgtcaa ggacgcgctg ctcgaccgcg ccgaggtgcc 23792 

cctcagcgtg gcggagctca gccgcgccta cgagcgcaac ctcttctcgc cgcgcgtgcc 23852 
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ccccaagcgc cagcccaacg gcacctgtga gcccaacccg cgcctcaact tctacccggt 23912 
cttcgcggtg cccgaggccc tggccaccta ccacctcttt ttcaagaacc aaaggatccc 23972 
cgtctcctgc cgcgccaacc gcacccgcgc cgacgccctg ctcaacctgg gccccggcgc 24032 
ccgcctacct gatatcacct ccttggaaga ggttcccaag atcttcgagg gtctgggcag 24092 
cgacgagact cgggccgcga acgctctgca aggaagcgga gaggagcatg agcaccacag 24152 
cgccctggtg gagttggaag gcgacaacgc gcgcctggcg gtcctcaagc gcacggtcga 24212 
gctgacccac ttcgcctacc cggcgctcaa cctgcccccc aaggtcatga gcgccgtcat 24272 
ggaccaggtg ctcatcaagc gcgcctcgcc cctctcggag gaggagatgc aggaccccga 24332 
gagttcggac gagggcaagc ccgtggtcag cgacgagcag ctggcgcgct ggctgggagc 24392 
gagtagcacc ccccagagcc tggaagagcg gcgcaagctc atgatggccg tggtcctggt 24452 
gaccgtggag ctggagtgtc tgcgccgctt ctttgccgac gcggagaccc tgcgcaaggt 24512 
cgaggagaac ctgcactacc tcttcaggca cgggttcgtg cgccaggcct gcaagatctc 24572 
caacgtggag ctgaccaacc tggtctccta catgggcatc ctgcacgaga accgcctggg 24 632 
gcaaaacgtg ctgcacacca ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg 24692 
cgtctacctg tacctctgcc acacctggca gacgggcatg ggcgtgtggc agcagtgcct 24752 
ggaggagcag aacctgaaag agctctgcaa gctcctgcag aagaacctca aggccctgtg 24812 
gaccgggttc gacgagcgta ccaccgcctc ggacctggcc gacctcatct tccccgagcg 24872 
cctgcggctg acgctgcgca acgggctgcc cgactttatg agccaaagca tgttgcaaaa 24932 
ctttcgctct ttcatcctcg aacgctccgg gatcctgccc gccacctgct ccgcgctgcc 24992 
ctcggacttc gtgccgctga ccttccgcga gtgccccccg ccgctctgga gccactgcta 25052 
cttgctgcgc ctggccaact acctggccta ccactcggac gtgatcgagg acgtcagcgg 25112 
cgagggtctg ctggagtgcc actgccgctg caacctctgc acgccgcacc gctccctggc 25172 
ctgcaacccc cagctgctga gcgagaccca gatcatcggc accttcgagt tgcaaggccc 25232 
cggcgacggc gagggcaagg ggggtctgaa actcaccccg gggctgtgga cctcggccta 25292 
cttgcgcaag ttcgtgcccg aggactacca tcccttcgag atcaggttct acgaggacca 25352 
atcccagccg cccaaggccg agctgtcggc ctgcgtcatc acccaggggg ccatcctggc 25412 
ccaattgcaa gccatccaga aatcccgcca agaatttctg ctgaaaaagg gccacggggt 25472 
ctacttggac ccccagaccg gagaggagct caaccccagc ttcccccagg atgccccgag 25532 
gaagcagcaa gaagctgaaa gtggagctgc cgccgccgga ggatttggag gaagactggg 25592 
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agagcagtca ggcagaggag gaggagatgg aagactggga cagcactcag gcagaggagg 25652 

acagcctgca agacagtctg gaggaggaag acgaggtgga ggaggcagag gaagaagcag 25712 

ccgccgccag accgtcgtcc tcggcggaga aagcaagcag cacggatacc atctccgctc 25772 

cgggtcgggg tcgcggcggc cgggcccaca gtaggtggga cgagaccggg cgcttcccga 25832 

accccaccac ccagaccggt aagaaggagc ggcagggata caagtcctgg cgggggcaca 25892 

aaaacgccat cgtctcctgc ttgcaagcct gcgggggcaa catctccttc acccggcgct 25952 

acctgctctt ccaccgcggg gtgaacttcc cccgcaacat cttgcattac taccgtcacc 26012 

tccacagccc ctactactgt ttccaagaag aggcagaaac ccagcagcag cagaaaacca 26072 

gcggcagcag cagctagaaa atccacagcg gcggcaggtg gactgaggat cgcggcgaac 26132 

gagccggcgc agacccggga gctgaggaac cggatctttc ccaccctcta tgccatcttc 26192 

cagcagagtc gggggcagga gcaggaactg aaagtcaaga accgttctct gcgctcgctc 26252 

acccgcagtt gtctgtatca caagagcgaa gaccaacttc agcgcactct cgaggacgcc 26312 

gaggctctct tcaacaagta ctgcgcgctc actcttaaag agtagcccgc gcccgcccac 26372 

acacggaaaa aggcgggaat tacgtcacca cctgcgccct tcgcccgacc atcatgagca 26432 

aagagattcc cacgccttac atgtggagct accagcccca gatgggcctg gccgccggcg 26492 

ccgcccagga ctactccacc cgcatgaact ggctcagtgc cgggcccgcg atgatctcac 26552 

gggtgaatga catccgcgcc caccgaaacc agatactcct agaacagtca gcgatcaccg 26612 

ccacgccccg ccatcacctt aatccgcgta attggcccgc cgccctggtg taccaggaaa 26672 

ttccccagcc cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta 26732 

actcaggtgt ccagctggcc ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta 2 6792 

taaagcggct ggtgatccga ggcagaggca cacagctcaa cgacgaggtg gtgagctctt 26852 

cgctgggtct gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca 26912 

cgcctcgtca ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggcggca 26972 

tcggcactct ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg 27032 

gctcccccgg ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg 27092 

tggacggcta cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc 27152 

tggaccactg ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg 27212 

agctgcccga ggagcaccct cagggcccag cccacggagt gcggatcatc gtcgaagggg 27272 
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gcctcgactc ccacctgctt cggatcttca gccagcgacc gatcctggtc gagcgcgaac 27332 
aaggacagac ccttcttact ttgtactgca tctgcaacca ccccggcctg catgaaagtc 27392 
tttgttgtct gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactc 27452 
gattgtggtg ttcctgctat caaccggtcc ctgttcttca ccgggaacga gaccgagctc 27512 
cagctccagt gtaagcccca caagaagtac ctcacctggc tgttccaggg ctccccgatc 27572 
gccgttgtca accactgcga caacgacgga gtcctgctga gcggccctgc caaccttact 27632 
ttttccaccc gcagaagcaa gctccagctc ttccaaccct tcctccccgg gacctatcag 27692 
tgcgtctcag gaccctgcca tcacaccttc cacctgatcc cgaataccac agcgccgctc 27752 
cccgctacta acaaccaaac tacccaccaa cgccaccgtc gcgacctttc ctctgaatct 27812 
aataccacta ccggaggtga gctccgaggt cgaccaacct ctgggattta ctacggcccc 27872 
tgggaggtgg tggggttaat agcgctaggc ctagttgcgg gtgggctttt ggttctctgc 27932 
tacctatacc tcccttgctg ttcgtactta gtggtgctgt gttgctggtt taagaaatgg 27992 
ggaagatcac cctagtgagc tgcggtgcgc tggtggcggt gttgctttcg attgtgggac 28 052 
tgggcggcgc ggctgtagtg aaggagaagg ccgatccctg cttgcatttc aatcccaaca 28112 
aatgccagct gagttttcag cccgatggca atcggtgcgc ggtactgatc aagtgcggat 28172 
gggaatgcga gaacgtgaga atcgagtaca ataacaagac tcggaacaat actctcgcgt 28232 
ccgtgtggca gcccggggac cccgagtggt acaccgtctc tgtccccggt gctgacggct 28292 
ccccgcgcac cgtgaataat actttcattt ttgcgcacat gtgcaacacg gtcatgtgga 28352 
tgagcaagca gtacgatatg tggcccccca cgaaggagaa catcgtggtc ttctccatcg 28412 
cttacagcct gtgcacggcg ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca 28472 
tcgctattcg ccccagaaat aatgccgaga aagagaaaca gccataacac gttttttcac 28532 
acaccttgtt tttacagaca atgcgtctgt taaatttttt aaacattgtg ctcagtattg 28592 
cttatgcctc tggttatgca aacatacaga aaacccttta tgtaggatct gatggtacac 28652 
tagagggtac ccaatcacaa gccaaggttg catggtattt "ttatagaacc aacactgatc 28712 
cagttaaact ttgtaagggt gaattgccgc gtacacataa aactccactt acatttagtt 28772 
gcagcaataa taatcttaca cttttttcaa ttacaaaaca atatactggt acttattaca 28832 
gtacaaactt tcatacagga caagataaat attatactgt taaggtagaa aatcctacca 28892 
ctcctagaac taccaccacc accactactg caaagcccac tgtgaaaact acaactagga 28952 
ccaccacaac tacagaaacc accaccagca caacacttgc tgcaactaca cacacacaca 29012 
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ctaagctaac 


cttacagacc 


actaatgatt 


tgatcgccct 


gctgcaaaag 


ggggataaca 


29072 


gcaccacttc 


caatgaggag 


atacccaaat 


ccatgattgg 


cattattgtt 


gctgtagtgg 


29132 


tgtgcatgtt 


gatcatcgcc 


ttgtgcatgg 


tgtactatgc 


cttctgctac 


agaaagcaca 


29192 


gactgaacga 


caagctggaa 


cacttactaa 


gtgttgaatt 


ttaatttttt 


agaaccatga 


29252 


agatcctagg 


cctttttagt 


ttttctatca 


ttacctctgc 


tctttgtgaa 


tcagtggata 


29312 


gagatgttac 


tattaccact 


ggttctaatt 


atacactgaa 


agggccaccc 


tcaggtatgc 


29372 


tttcgtggta 


ttgctatttt 


ggaactgaca 


ctgatcaaac 


tgaattatgc 


aattttcaaa 


29432 


aaggcaaaac 


ctcaaactct 


aaaatctcta 


attatcaatg 


caatggcact 


gatctgatac 


29492 


tactcaatgt 


cacgaaagca 


tatggtggca 


gttattattg 


ccctggacaa 


aacactgaag 


29552 


aaatgatttt 


ttacaaagtg 


gaagtggttg 


atcccactac 


accacccacc 


accacaacta 


29612 


ttcataccac 


acacacagaa 


caaacaccag 


aggcaacaga 


agcagagttg 


gccttccagg 


29672 


ttcacggaga 


ttcctttgct 


gtcaataccc 


ctacacccga 


tcagcggtgt 


ccggggccgc 


29732 


tagtcagcgg 


cattgtcggt 


gtgctttcgg 


gattagcagt 


cataatcatc 


tgcatgttca 


29792 


tttttgcttg 


ctgctataga 


aggctttacc 


gacaaaaatc 


agacccactg 


ctgaacctct 


29852 


atgtttaatt 


ttttccagag 


ccatgaaggc 


agttagcgct 


ctagtttttt 


gttctttgat 


29912 


tggcattgtt 


tttaatagta 


aaattaccag 


agttagcttt 


attaaacatg 


ttaatgtaac 


29972 


tgaaggagat 


aacatcacac 


tagcaggtgt 


agaaggtgct 


caaaacacca 


cctggacaaa 


30032 


ataccatcta 


ggatggagag 


atatttgcac 


ctggaatgta 


acttattatt 


gcataggagt 


30092 


taatcttacc 


attgttaacg 


ctaaccaatc 


tcagaatggg 


ttaattaaag 


gacagagtgt 


30152 


tagtgtgacc 


agtgatgggt 


actataccca 


gcatagtttt 


aactacaaca 


ttactgtcat 


30212 


accactgcct 


acgcctagcc 


cacctagcac 


taccacacag 


acaaccacat 


acagtacatc 


30272 


aaatcagcct 


accaccacta 


cagcagcaga 


ggttgccagc 


tcgtctgggg 


tccgagtggc 


30332 


atttttgatg 


ttggccccat 


ctagcagtcc 


cactgctagt 


accaatgagc 


agactactga 


30392 


atttttgtcc 


actgtcgaga 


gccacaccac 


agctacctcc 


agtgccttct 


ctagcaccgc 


30452 


caatctctcc 


tcgctttcct 


ctacaccaat 


cagccccgct 


actactccta 


gccccgctcc 


30512 


tcttcccact 


cccctgaagc 


aaacagacgg 


cggcatgcaa 


tggcagatca 


ccctgctcat 


30572 


tgtgatcggg 


ttggtcatcc 


tggccgtgtt 


gctctactac 


atcttctgcc 


gccgcattcc 


30632 


caacgcgcac 


cgcaagccgg 


cctacaagcc 


catcgttatc 


gggcagccgg 


agccgcttca 


30692 
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ggtggaaggg ggtctaagga atcttctctt ctcttttaca gtatggtgat tgaactatga 30752 

ttcctagaca attcttgatc actattctta tctgcctcct ccaagtctgt gccaccctcg 30812 

ctctggtggc caacgccagt ccagactgta ttgggccctt cgcctcctac gtgctctttg 30872 

ccttcgtcac ctgcatctgc tgctgtagca tagtctgcct gcttatcacc ttcttccagt 30932 

tcattgactg gatctttgtg cgcatcgcct acctgcgcca ccacccccag taccgcgacc 30992 

agcgagtggc gcagctgctc aggctcctct gataagcatg cgggctctgc tacttctcgc 31052 

gcttctgctg ttagtgctcc cccgtcccgt cgacccccgg tcccccactc agtcccccga 31112 

ggaggttcgc aaatgcaaat tccaagaacc ctggaaattc ctcaaatgct accgccaaaa 31172 

atcagacatg catcccagct ggatcatgat cattgggatc gtgaacattc tggcctgcac 31232 

cctcatctcc tttgtgattt acccctgctt tgactttggt tggaactcgc cagaggcgct 31292 

ctatctcccg cctgaacctg acacaccacc acagcagcaa cctcaggcac acgcactacc 31352 

accaccacag cctaggccac aatacatgcc catattagac tatgaggccg agccacagcg 31412 

acccatgctc cccgctatta gttacttcaa tctaaccggc ggagatgact gacccactgg 31472 

ccaataacaa cgtcaacgac cttctcctgg acatggacgg ccgcgcctcg gagcagcgac 31532 

tcgcccaact tcgcattcgt cagcagcagg agagagccgt caaggagctg caggacggca 31592 

tagccatcca ccagtgcaag agaggcatct tctgcctggt gaaacaggcc aagatctcct 31652 

acgaggtcac ccagaccgac catcgcctct cctacgagct cctgcagcag cgccagaagt 31712 

tcacctgcct ggtcggagtc aaccccatcg tcatcaccca gcagtcgggc gataccaagg 31772 

ggtgcatcca ctgctcctgc gactcccccg actgcgtcca cactctgatc aagaccctct 31832 

gcggcctccg cgacctcctc cccatgaact aatcaccccc ttatccagtg aaataaagat 31892 

catattgatg atgatttaaa taaaaaaaat aatcatttga tttgaaataa agatacaatc 31952 

atattgatga tttgagttta acaaaaataa agaatcactt acttgaaatc tgataccagg 32012 

tctctgtcca tgttttctgc caacaccacc tcactcccct cttcccagct ctggtactgc 32072 

aggccccggc gggctgcaaa cttcctccac acgctgaagg ggatgtcaaa ttcctcctgt 32132 

ccctcaatct tcattttatc ttctatcag atg tec aaa aag cgc gtc egg gtg 32185 

Met Ser Lys Lys Arg Val Arg Val 
1475 

gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32230 
Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala 
1480 1485 1490 
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ccg acc gtg ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga 

Pro Thr Val Pro Phe He Asn Pro Pro Phe Val Ser Ser Asp Gly 

1495 1500 1505 y 

ttc caa gag aag ccc ctg ggg gtg ttg tec ctg cga ctg get gac 

Phe Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp 

1510 1515 1520 

ccc gtc acc acc aag aac ggg gaa ate acc etc aag ctg gga gag 

Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Glu 

1525 1530 1535 

ggg gtg gac etc gac teg teg gga aaa etc ate tec aac acg gcc 

Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser Asn Thr Ala 

I 540 1545 1550 

acc aag gcc gcc gcc cct etc agt att tea aac aac acc att tec 

Thr Lys Ala Ala Ala Pro Leu Ser He Ser Asn Asn Thr He Ser 

1555 1560 i5 6 5 



ctt aaa act get gcc cct ttc tac aac aac aat gga act tta age 
Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser 
I 570 1575 1580 



etc aat gtc tec aca cca tta gca gta ttt ccc aca ttt aac act 
Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
1585 1590 1595 

tta ggc ata agt ctt gga aac ggt ctt cag act tea aat aag ttg 
Leu Gly He Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu 
1600 i 6 05 1610 



ttg act gta caa eta act cat cct ctt aca ttc 
Leu Thr Val Gin Leu Thr His Pro Leu Thr Phe 
I 615 1620 1625 



age tea aat age 
Ser Ser Asn Ser 



ate aca gta aaa aca gac aaa ggg eta tat att aac tec agt gga 
He Thr Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly 
1630 1635 1640 

aac aga gga ctt gag get aat ata age eta aaa aga gga eta gtt 
Asn Arg Gly Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val 
1645 1650 1655 

ttt gac ggt aat get att gca aca tat att gga aat ggc tta gac 
Phe Asp Gly Asn Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp 
1660 1665 1670 

tat gga tct tat gat agt gat gga aaa aca aga ccc gta att acc 
Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Val He Thr 
1675 1680 1685 

aaa att gga gca gga tta aat ttt gat get aac aaa gca ata get 
Lys He Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala He Ala 
1690 1695 1700 



32275 



32320 



32365 



32410 



32455 



32500 



32545 



32590 



32635 



32680 



32725 



32770 



32815 



32860 
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gtc aaa eta ggc aca ggt tta agt ttt gac tec 
Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser 
1705 1710 1715 



get ggt gee ttg 
Ala Gly Ala Leu 



32905 



aca get gga aac aaa cag gat gac aag eta aca ctt tgg act acc 
Thr Ala Gly Asn Lys Gin Asp Asp Lys Leu Thr Leu Trp Thr Thr 
1720 1725 1730 



32950 



cct gac cca age cct aat tgt caa tta ctt tea 
Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser 
1735 1740 1745 



gac aga gat gee 
Asp Arg Asp Ala 



aaa ttt act etc tgt ctt aca aaa tgc ggt agt caa ata eta ggc 
Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin lie Leu Gly 
1750 1755 1760 

act gtg gca gtg gcg get gtt act gta gga tea gca eta aat cca 
Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro 
1765 1770 1775 

att aat gac aca gtc aaa age gee ata gtt ttc ctt aga ttt gat 
He Asn Asp Thr Val Lys Ser Ala He Val Phe Leu Arg Phe Asp 
1780 1785 1790 



tec gat ggt gta etc atg 
Ser Asp Gly Val Leu Met 
1795 1800 



tea aac tea tea atg gta ggt gat tac 
Ser Asn Ser Ser Met Val Gly Asp Tyr 
1805 



32995 



33040 



33085 



33130 



33175 



tgg aac ttt agg gag gga cag acc act caa agt gta gee tat aca 
Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala- Tyr Thr 
1810 1815 1820 



33220 



aat get gtg gga ttc atg cca aat ata ggt gca 
Asn Ala Val Gly Phe Met Pro Asn He Gly Ala 
1825 1830 1835 



tat cca aaa acc 
Tyr Pro Lys Thr 



33265 



caa agt aaa aca cct aaa aat age ata gtc agt cag gta tat tta 
Gin Ser Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu 
1840 1845 1850 



33310 



act gga gaa act act atg cca atg aca eta acc 
Thr Gly Glu Thr Thr Met Pro Met Thr Leu Thr 
1855 I860 1865 



ata act ttc aat 
He Thr Phe Asn 



33355 



ggc act gat gaa aaa gac aca acc cca gtt age acc tac tct atg 
Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met 
1870 1875 1880 



33400 



act ttt aca tgg cag tgg act gga gac tat aag gac aaa aat att 

Thr Phe Thr Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn He 

1885 1890 1895 

acc ttt get acc aac tea ttc tct ttt tec tac ate gee cag gaa 

Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr He Ala Gin Glu 

1900 1905 1910 



33445 



33490 



taa tcccacccag caagccaacc ccttttccca ccacctttgt ctatatggaa 



33543 
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actctgaaac 


agaaaaataa 


agttcaagtg 


ttttattgaa 


tcaacagttt 


tacaggactc 


33603 


gagcagttat 


ttttcctcca 


ccctcccagg 


acatggaata 


caccaccctc 


tccccccgca 


33663 


cagccttgaa 


catctgaatg 


ccattggtga 


tggacatget 


tttggtctcc 


acgttccaca 


33723 


cagtttcaga 


gcgagccagt 


cteggategg 


tcagggagat 


gaaaccctcc 


gggcactccc 


33783 


gcatctgcac 


ctcacagctc 


aacagctgag 


gattgtcctc 


ggtggtcggg 


ateaeggtta 


33843 


tctggaagaa 


gcagaagagc 


ggcggtggga 


atcatagtcc 


gcgaacggga 


teggceggtg 


33903 


gtgtcgcatc 


aggccccgca 


geagtegctg 


ccgccgccgc 


tccgtcaagc 


tgctgctcag 


33963 


ggggttcggg 


tccagggact 


ccctcagcat 


gatgcccacg 


gccctcagca 


teagtegtet 


34023 


ggtgcggcgg 


gcgcagcagc 


geatgegaat 


ctcgctcagg 


teactgeagt 


aegtgeaaca 


34083 


caggaccacc 


aggttgttca 


acagtccata 


gttcaacacg 


ctccagccga 


aactcatcgc 


34143 


gggaaggatg 


ctacccacgt 


ggccgtcgta 


ccagatcctc 


aggtaaatca 


agtggcgctc 


34203 


cctccagaag 


acgctgccca 


tgtacatgat 


ctccttgggc 


atgtggcggt 


tcaccacctc 


34263 


ccggtaccac 


atcaccctct 


ggttgaacat 


gcagccccgg 


atgatcctgc 


ggaaccacag 


34323 


ggccagcacc 


gccccgcccg 


ccatgcagcg 


aagagacccc 


ggatcccggc 


aatgacaatg 


34383 


gaggacccac 


cgctcgtacc 


cgtggatcat 


ctgggagctg 


aacaagtcta 


tgttggcaca 


34443 


gcacaggcat 


atgctcatgc 


atctcttcag 


cactctcagc 


tecteggggg 


tcaaaaccat 


34503 


atcccagggc 


acggggaact 


ettgeaggae 


agcgaacccc 


gcagaacagg 


gcaatcctcg 


34563 


cacataactt 


acattgtgca 


tggacagggt 


ategcaatea 


ggcagcaccg 


ggtgatcctc 


34623 


caccagagaa 


gcgcgggtct 


cggtctcctc 


acagcgtggt 


aagggggccg 


gecgataegg 


34683 


gtgatggcgg 


gacgcggctg 


atcgtgttct 


cgaccgtgtc 


atgatgcagt 


tgctttcgga 


34743 


cattttcgta 


cttgctgtag 


cagaacctgg 


tccgggcgct 


gcacaccgat 


cgccggcggc 


34803 


ggtctcggcg 


cttggaacgc 


tcggtgttaa 


agttgtaaaa 


cagccactct 


ctcagaccgt 


34863 


gcagcagatc 


tagggcctca 


ggagtgatga 


agatcccatc 


atgectgata 


gctctgatca 


34923 


catcgaccac 


cgtggaatgg 


gccaggccca 


gecagatgat 


gcaattttgt 


tgggtttcgg 


34983 


tcra cicscjcncicr 
*-*m ti ^**y y y y y 


crcra tier era a era 

yya.yyycxci.yci. 




cca t~ rf a f t"33 

* — <w- Cl LUCl I— L>dCt 




add^y y lulu 


35043 


ggagcacttc 


aaaatgaagg 


teaeggagat 


ggcacctctc 


gcccccgctg 


tgttggtgga 


35103 


aaataacagc 


caggtcaaag 


gtgatacggt 


tctcgagatg 


ttccacggtg 


gcttccagca 


35163 


aagcctccac 


gcgcacatcc 


agaaacaaga 


caatagegaa 


agegggaggg 


ttctctaatt 


35223 
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cctcaaccat 


catgttacac 


tcctgcacca 


tccccagata 


attttcattt ttccagcctt 


35283 


gaatgattcg 


aactagttcc 


tgaggtaaat 


ccaagccagc 


catgataaaa agctcgcgca 


35343 


gagcaccctc 


caccggcatt 


cttaagcaca 


ccctcataat 


tccaagatat tctgctcctg 


35403 


gttcacctgc 


agcagattga 


caagcggaat 


atcaaaatct 


ctgccgcgat ccctgagctc 


35463 


ctccctcagc 


aataactgta 


agtactcttt 


catatcgtct 


ccgaaatttt tagccatagg 


35523 


acccccagga 


ataagagaag 


ggcaagccac 


attacagata 


aaccgaagtc ccccccagtg 


35583 


agcattgcca 


aatgtaagat 


tgaaataagc 


atgctggcta 


gacccggtga tatcttccag 


35643 


ataactggac 


agaaaatcgg 


gtaagcaatt 


tttaagaaaa 


tcaacaaaag aaaaatcttc 


35703 


caggtgcacg 


tttagggcct 


cgggaacaac 


gatggagtaa 


gtgcaagggg tgcgttccag 


35763 


catggttagt 


tagctgatct 


gtaaaaaaac 


aaaaaataaa 


acattaaacc atgctagcct 


35823 


ggcgaacagg 


tgggtaaatc 


gttctctcca 


gcaccaggca 


ggccacgggg tctccggcgc 


35883 


gaccctcgta 


aaaattgtcg 


ctatgattga 


aaaccatcac 


agagagacgt tcccggtggc 


35943 


cggcgtgaat 


gattcgagaa 


gaagcataca 


cccccggaac 


attggagtcc gtgagtgaaa 


36003 


aaaagcggcc 


gaggaagcaa 


tgaggcacta 


caacgctcac 


tctcaagtcc agcaaagcga 


36063 


tgccatgcgg 


atgaagcaca 


aaattttcag 


gtgcgtaaaa 


aatgtaatta ctcccctcct 


36123 


gcacaggcag 


cgaagctccc 


gatccctcca 


gatacacata 


caaagcctca gcgtccatag 


36183 


cttaccgagc 


ggcagcagca 


gcggcacaca 


acaggcgcaa 


gagtcagaga aaagactgag 


36243 


ctctaacctg 


tccgcccgct 


ctctgctcaa 


tatatagccc 


cagatctaca ctgacgtaaa 


36303 


ggccaaagtc 


taaaaatacc 


cgccaaataa 


tcacacacgc 


ccagcacacg cccagaaacc 


36363 


ggtgacacac 


tcagaaaaat 


acgcgcactt 


cctcaaacgg 


ccaaactgcc gtcatttccg 


36423 


ggttcccacg 


ctacgtcatc 


aaaacacgac 


tttcaaattc 


cgtcgaccgt taaaaacatc 


36483 


acccgccccg 


cccctaacgg 


tcgccgctcc 


cgcagccaat 


caccttcctc cctccccaaa 


36543 


ttcaaacagc 


tcatttgcat 


attaacgcgc 


accaaaagtt 


tgaggtatat tattgatgat 


36603 


g 










36604 



<210> 6 

<211> 529 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
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<400> 6 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
15 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 

Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr Asp Gly Ser Gin Asp 
145 150 155 ~ 160 

Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe 
165 170 175 

Ser Val Thr Met Thr lie Asp Leu Met Asn Asn Ala He He Asp Asn 
180 185 190 

Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly 
195 200 205 

Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
210 215 220 

Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
225 230 235 240 

He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu 
245 250 255 

Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe 
260 265 270 

Gin He Leu Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu 
275 280 285 



63 



WO 03/046124 



PCT/US02/33645 



Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp Ser Thr Ala Ala Ala 
290 295 300 

Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe 
305 310 315 "* " 320 

Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys lie 
325 330 335 

Val lie Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val 
340 345 "* 350 

Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr 
355 360 " 365 

Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr 
370 375 380 

Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro 
385 390 395 ~ * 400 

Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser 
405 410 415 

Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser 
420 425 ~ 430 

Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr 
435 440 445 

Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val 
450 455 460 

Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala 
465 470 475 480 

Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Arg Gly Val 
485 490 495 

Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val 
500 505 ^ ~ 510 

Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr 
515 520 525 



Phe 



<210> 7 
<211> 942 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
<400> 7 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 ~ 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 " 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 ** 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr Gly Asn Gly Gly 
130 135 140 

Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn 
145 150 155 160 

lie Thr Lys Asp Gly Leu Gin lie Gly Thr Asp Val Thr Ala Asn Gin 
165 170 175 

Asn Lys Pro lie Tyr Ala Asp Lys Thr Phe Gin Pro Glu Pro Gin Val 
180 ' " 185 190 

Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala 
195 200 " 205 

Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg 
210 215 " 220 

Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys Val Gly Asp Asp 
225 230 235 240 

Gly Val Pro Thr Lys Glu Phe Asp lie Asp Leu Ala Phe Phe Asp Thr 
245 250 255 

Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys Ala Asp lie Val 
260 265 270 

Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val 
275 280 ^ 285 

Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu lie Asn Leu Val Gin 
290 295 300 

Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe 
305 310 315 320 
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lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala 
325 330 335 

Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn 
340 345 ~ 350 

Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr 
355 360 365 

Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp 
370 375 3S0 

Val Arg lie lie Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr 
385 390 395 400 

Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gin Gly Val 
405 410 415 

Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser Glu Trp Glu Asn 
420 ' 425 430 

Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys Gly Asn lie Phe 
435 440 ~ 445 

Ala Met Glu He Asn Leu Gin Ala Asn Leu Trp Arg Ser Phe Leu Tyr 
450 455 460 

Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Thr 
465 470 475 480 

Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly 
485 490 ~ 495 

Arg Val Thr Pro Pro Ser Leu Val Asp Ala Tyr Leu Asn He Gly Ala 
500 505 510 

Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His 
515 520 525 

Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg 
530 535 540 

Tyr Val Pro Phe His He Gin Val Pro Gin Lys Phe Phe Ala He Lys 
545 550 555 560 

Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg 
565 570 575 

Lys Asp Val Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg 
580 585 590 

Thr Asp Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr 
595 600 605 



Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 
610 615 620 
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Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala Ala 



630 



635 



640 



Asn Met Leu Tyr Pro lie Pro Ala Asn Ala Thr Asn Val Pro lie Ser 
645 650 655 

lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg 
660 3 



665 



670 



Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr 



680 



685 



Phe Val Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu 
690 695 7 00 

Asn His Thr Phe Lys Lys Val Ser lie Thr Phe Asp Ser Ser Val Ser 



710 



715 



720 



Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys 
7 25 730 



735 



Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 

740 745 75 0 

Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn lie Gly Tyr 

755 7 6 o ?65 

Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe 

110 775 7 8 o 

Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn 



790 



795 800 
Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser 



805 



810 



815 



Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr 
820 825 



830 



Pro Ala Asn Tyr Pro Tyr Pro Leu lie Gly Lys Ser Ala Val Ala Ser 
835 840 845 

Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He Pro 



855 



860 



Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gin 

870 875 880 

Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu 
885 890 895 

Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val 
900 905 910 

Phe Asp Val Val Arg Val His Gin Pro His Arg Gly Val lie Glu Ala 
915 920 925 
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Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
930 935 940 



<210> 8 
<211> 443 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
<400> 8 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
15 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe He Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser 
65 70 75 ~ 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser He Ser Asn Asn Thr 
85 90 95 

He Ser Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu 
100 105 no 

Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
115 120 125 

Leu Gly He Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu Leu 
130 135 140 

Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser He Thr 
145 150 155 160 

Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly Asn Arg Gly 
165 170 175 

Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val Phe Asp Gly Asn 
180 185 190 

Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp Tyr Gly Ser Tyr Asp 
195 200 205 

Ser Asp Gly Lys Thr Arg Pro Val He Thr Lys He Gly Ala Gly Leu 
210 215 220 

Asn Phe Asp Ala Asn Lys Ala He Ala Val Lys Leu Gly Thr Gly Leu 
225 230 235 240 
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Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Gin Asp Asp 
245 250 " 255 

Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu 
260 265 270 

Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly 
275 ~ ^ ' 280 ~ 285 

Ser Gin He Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser 
290 295 300 

Ala Leu Asn Pro He Asn Asp Thr Val Lys Ser Ala He Val Phe Leu 
305 310 " 315 320 

Arg Phe Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly 
325 ~ 330 335 

Asp Tyr Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr 
340 ~ 345 350 

Thr Asn Ala Val Gly Phe Met Pro Asn He Gly Ala Tyr Pro Lys Thr 
355 360 365 

Gin Ser Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu Thr 
370 375 380 

Gly Glu Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn Gly Thr 
385 390 395 400 

Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr 
405 410 " 415 

Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn He Thr Phe Ala Thr 
420 ~ 425 "* * 430 



Asn Ser Phe Ser Phe Ser Tyr He Ala Gin Glu 
435 ~ 440 



<210> 9 

<211> 36535 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan7 
<220> 

<221> CDS 

<222> (13874) . . (15469) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18288) . . (21086) 

<223> L3 Hexon 
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<220> 

<221> CDS 

<222> (32094) . . (33425) 
<223> L5 Fiber 

<400> 9 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 

atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120 

tgacgttttt aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 

aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 

tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 

agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 

ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 

ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 
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atccgtgttg actttaagtg cgtggtttat gactcagggg tggggactgt gggtatataa 1560 

gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620 

gaagactttc accagactag acagctgcta gagaactcat cggagggggt ctcttacctg 1680 

tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 

aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 

ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 

agaactaccg ccgcggtagc cttttttgcc tttatccttg acaaatggag tcaagaaacc 192 0 

catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 

tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 

atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 

cagcaagagg aggaggagga tcgagaagag aacccgagag ccggtctgga ccctccggtg 2160 

gcggaggagg aggagtagct gacttgtttc ccgagctgcg ccgggtgctg actaggtctt 2220 

ccagtggacg ggagaggggg attaagcggg agaggcatga ggagactagc cacagaactg 2280 

aactgactgt cagtctgatg agccgcaggc gcccagaatc ggtgtggtgg catgaggttc 2340 

agtcgcaggg gatagatgag gtctcggtga tgcatgagaa atattccctg gaacaagtca 2400 

agacttgttg gttggagcct gaggatgatt gggaggtagc catcaggaat tatgccaagc 2460 

tggctctgaa gccagacaag aagtacaaga ttaccaaact gattaatatc agaaattcct 2520 

gctacatttc agggaatggg gccgaggtgg agatcagtac ccaggagagg gtggccttca 2580 

gatgttgtat gatgaatatg tacccggggg tggtgggcat ggagggagtc acctttatga 2640 

acgcgaggtt caggggtgat gggtataatg gggtggtctt tatggccaac accaagctga 2700 

cagtgcacgg atgctccttc tttgggttca ataacatgtg catcgaggcc tggggcagtg 2760 

tttcagtgag gggatgcagc ttttcagcca actggatggg ggtcgtgggc agaaccaaga 2820 

gcaaggtgtc agtgaagaaa tgcctgttcg agaggtgcca cctgggggtg atgagcgagg 2880 

gcgaagccaa agtcaaacac tgcgcctcta ctgagacggg ctgctttgtg ctgatcaagg 2940 

gcaatgccca agtcaagcat aacatgatct gtggggcctc ggatgagcgc ggctaccaga 3000 

tgctgacctg cgccggtggg aacagccata tgctggccac cgtgcatgtg acctcgcacc 3060 

cccgcaagac atggcccgag ttcgagcaca acgtcatgac ccgatgcaat gtgcacctgg 3120 

ggtcccgccg aggcatgttc atgccctacc agtgcaacat gcaatttgtg aaggtgctgc 3180 

tggagcccga tgccatgtcc agagtgagcc tgacgggggt gtttgacatg aatgtggagc 3240 
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tgtggaaaat tctgagatat gatgaatcca agaccaggtg ccgggcctgc gaatgcggag 3300 
gcaagcacgc caggcttcag cccgtgtgtg tggaggtgac ggaggacctg cgacccgatc 3360 
atttggtgtt gtcctgcaac gggacggagt tcggctccag cggggaagaa tctgactaga 3420 
gtgagtagtg tttgggggag gtggagggct tgtatgaggg gcagaatgac taaaatctgt 3480 
gtttttctgt gtgttgcagc agcatgagcg gaagcgcctc ctttgaggga ggggtattca 3540 
gcccttatct gacggggcgt ctcccctcct gggcgggagt gcgtcagaat gtgatgggat 3600 
ccacggtgga cggccggccc gtgcagcccg cgaactcttc aaccctgacc tacgcgaccc 3660 
tgagctcctc gtccgtggac gcagctgccg ccgcagctgc tgcttccgcc gccagcgccg 3720 
tgcgcggaat ggccctgggc gccggctact acagctctct ggtggccaac tcgacttcca 3780 

ccaataatcc cgccagcctg aacgaggaga agctgctgct gctgatggcc cagctcgagg 3840 

ccctgaccca gcgcctgggc gagctgaccc agcaggtggc tcagctgcag gcggagacgc 3900 

gggccgcggt tgccacggtg aaaaccaaat aaaaaatgaa tcaataaata aacggagacg 3960 

gttgttgatt ttaacacaga gtcttgaatc tttatttgat ttttcgcgcg cggtaggccc 4020 

tggaccaccg gtctcgatca ttgagcaccc ggtggatttt ttccaggacc cggtagaggt 4080 

gggcttggat gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt 4140 

gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg 4200 

cgtggtgctg cacgatgtcc ttgaggagga gactgatggc cacgggcagc cccttggtgt 4260 

aggtgttgac gaacctgttg agctgggagg gatgcatgcg gggggagatg agatgcatct 4320 

tggcctggat cttgagattg gcgatgttcc cgcccagatc ccgccggggg ttcatgttgt 4380 

gcaggaccac cagcacggtg tatccggtgc acttggggaa tttgtcatgc aacttggaag 4440 

ggaaggcgtg aaagaatttg gagacgccct tgtgaccgcc caggttttcc atgcactcat 4500 

ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg 4560 

acacatcgta gttgtggtcc tgggtgagct cgtcataggc cattttaatg aatttggggc 4620 

ggagggtgcc cgactggggg acgaaggtgc cctcgatccc gggggcgtag ttgccctcgc 4680 

agatctgcat ctcccaggcc ttgagctcgg agggggggat catgtccacc tgcggggcga 4740 

tgaaaaaaac ggtttccggg gcgggggaga tgagctgggc cgaaagcagg ttccggagca 4800 

gctgggactt gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt 4860 

agttgaggga gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct 4920 
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cgcgcacatg catgttctcg cgcacgagtt ccgccaggag gcgctcgccc cccagcgaga 4980 

ggagctcttg cagcgaggcg aagtttttca gcggcttgag yccgtcggcc atgggcattt 5040 

tggagagggt ctgttgcaag agttccagac ggtcccagag ctcggtgatg tgctctaggg 5100 

catctcgatc cagcagacct cctcgtttcg cgggttgggg cgactgcggg agtagggcac 5160 

caggcgatgg gcgtccagcg aggccagggt ccggtccttc cagggtcgca gggtccgcgt 5220 

cagcgtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg 5280 

cttcaggctc atccggctgg tcgagaaccg ctcccggtcg gcgccctgcg cgtcggccag 5340 

gtagcaattg agcatgagtt cgtagttgag cgcctcggcc gcgtggccct tggcgcggag 5400 

cttacctttg gaagtgtgtc cgcagacggg acagaggagg gacttgaggg cgtagagctt 5460 

gggggcgagg aagacggact cgggggcgta ggcgtccgcg ccgcagctgg cgcagacggt 5520 

ctcgcactcc acgagccagg tgaggtcggg ccggttgggg tcaaaaacga ggtttcctcc 5580 

gtgctttttg atgcgtttct tacctctggt ctccatgagc tcgtgtcccc gctgggtgac 5640 

aaagaggctg tccgtgtccc cgtagaccga ctttatgggc cggtcctcga gcggggtgcc 5700 

gcggtcctcg tcgtagagga accccgccca ctccgagacg aaggcccggg tccaggccag 5760 

cacgaaggag gccacgtggg aggggtagcg gtcgttgtcc accagcgggt ccaccttctc 5820 

cagggtatgc aagcacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt 5880 

gtaggccacg tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc 5940 

gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct 6000 

ctcgaaggct ggcataacct cggcactcag gttgtcagtt tctagaaacg aggaggattt 6060 

gatattgacg gtgccgttgg agacgccttt catgagcccc tcgtccatct ggtcagaaaa 6120 

gacgatcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgt tggagaggag 6180 

cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggcggcgat 6240 

gttgagctgc acgtactcgc gcgccacgca cttccattcg gggaagacgg tggtgagctc 6300 

gtcgggcacg attctgaccc gccagccgcg gttgtgcagg gtgatgaggt ccacgctggt 6360 

ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct tgcgcgagca 6420 

gaaggggggc agcgggtcca gcatgagctc gtcggggggg tcggcgtcca cggtgaagat 6480 

gccgggcaga agctcggggt cgaagtagct gatgcaggtg tccagatcgt ccagcgccgc 6540 

ttgccagtcg cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc cccagggcat 6600 

ggggtgcgtg agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc 6660 
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gaggacgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc 6720 

gtacagctcg tgcgagggcg cgaggagccc cgtgccgagg ttggagcgtt gcggcttttc 6780 

ggcgcggtag acgatctggc ggaagatggc gtgggagttg gaggagatgg tgggcctctg 6840 

gaagatgttg aagtgggcgt ggggcaggcc gaccgagtcc ctgatgaagt gggcgtagga 6900 

gtcctgcagc ttggcgacga gctcggcggt gacgaggacg tccagggcgc agtagtcgag 6960 

ggtctcttgg atgatgtcgt acttgagctg gcccttctgc ttccacagct cgcggttgag 7020 

aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatcggcacg 7080 

gtaagagccc accatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac 7140 

ggggagggcg taagcttgtg cggccttgcg cagggaggtg tgggtgaggg cgaaggtgtc 7200 

gcgcaccatg accttgagga actggtgctt gaagtcgagg tcgtcgcagc cgccctgctc 7260 

ccagagctgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc 7320 

gttgaagagg atcttgcccg cgcggggcat gaagttgcga gtgatgcgga aaggctgggg 7380 

cacctcggcc cggttgttga tgacctgggc ggcgaggacg atctcgtcga agccgttgat 7440 

gttgtgcccg acgatgtaga gttccacgaa tcgcgggcgg cccttaacgt ggggcagctt 7500 

cttgagctcg tcgtaggtga gctcggcggg gtcgctgagc ccgtgctgct cgagggccca 7560 

gtcggcgacg tgggggttgg cgctgaggaa ggaagtccag agatccacgg ccagggcggt 7620 

ctgcaagcgg tcccggtact gacggaactg ctggcccacg gccatttttt cgggggtgac 7680 

gcagtagaag gtgcgggggt cgccgtgcca gcggtcccac ttgagctgga gggcgaggtc 7740 

gtgggcgagc tcgacgagcg gcgggtcccc ggagagtttc atgaccagca tgaaggggac 7800 

gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag 7860 

cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc accagttgga 7920 

ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg cgcgccgagc actcgtgctt 7980 

gtgtttatac aagcgtccgc agtgctcgca acgctgcacg ggatgcacgt gctgcacgag 8040 

ctgtacctgg gttcctttga cgaggaattt cagtgggcag tggagcgctg gcggctgcat 8100 

ctggtgctgt actacgtcct ggccatcggc gtggccatcg tctgcctcga tggtggtcat 8160 

gctgacgagc ccgcgcggga ggcaggtcca gacttcggct cggacgggtc ggagagcgag 8220 

gacgagggcg cgcaggccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt 8280 

gggcagcggc ggcgcgcggt tgacttgcag gagcttttcc agggcgcgcg ggaggtccag 8340 



74 



WO 03/046124 



PCT7US02/33645 



atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca gggtcccgtg 8400 

cccctggggc gccaccaccg tgccccgttt cttcttgggc gctgcttcca tgccggtcag 8460 

aagcggcggc gaggacgcgc gccgggcggc aggggcggct cgggacccgg aggcaggggc 8520 

ggcaggggca cgtcggcgcc gcgcgcgggc aggttctggt actgcgcccg gagaagactg 8580 

gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 

ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt atcgttgacg 8700 

gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc gatctcggtc 8760 

atgaactgct cgatctcctc ctcctgaagg tctccgcggc cggcgcgctc gacggtggcc 8820 

gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc ggcctcgttc 88 80 

cagacgcggc tgtagaccac ggctccgtcg gggtcgcgcg cgcgcatgac cacctgggcg 894 0 

aggttgagct cgacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 

ttgagcgtgg tggcgatgtg ctcggtgacg aagaagtaca tgatccagcg gcggagcggc 9060 

atctcgctga cgtcgcccag ggcttccaag cgctccatgg cctcgtagaa gtccacggcg 9120 

aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 

agctcagcga tggtggcgcg cacctcgcgc tcgaaggccc cggggggctc ctcttcttcc 9240 

atctcttcct cctccactaa catctcttct acttcctcct caggaggcgg cggcggggga 9300 

ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 9360 

ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 9420 

aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 9480 

gcgctgacga tgcatcttat caattggccc gtagggactc cgcgcaagga cctgagcgtc 9540 

tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 9600 

ggtaggctga gcccggtttc ttgttcttcg gggatttcgg gaggcgggcg ggcgatgctg 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct ggggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtatagga gcagttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 
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gagtaggcgc 


gcgtgtcgaa 


gatgtagtcg 


ttgcaggtgc 


gcacgaggta 


ctggtatccg 


10140 


acgaggaagt 


gcggcggcgg 


ctggcggtag 


agcggccatc 


gctcggtggc 


gggggcgccg 


10200 


ggcgcgaggt 


cctcgagcat 


gaggcggtgg 


tagccgtaga 


tgtacctgga 


catccaggtg 


10260 


atgccggcgg 


cggtggtgga 


ggcgcgcggg 


aactcgcgga 


cgcggttcca 


gatgttgcgc 


10320 


agcggcagga 


agtagttcat 


ggtggccgcg 


gtctggcccg 


tgaggcgcgc 


gcagtcgtgg 


10380 


atgctctaga 


catacgggca 


aaaacgaaag 


cggtcagcgg 


ctcgactccg 


tggcctggag 


10440 


gctaagcgaa 


cgggttgggc 


tgcgcgtgta 


ccccggttcg 


aatctcgaat 


caggctggag 


10500 


ccgcagctaa 


cgtggtactg 


gcactcccgt 


ctcgacccaa 


gcctgctaac 


gaaacctcca 


10560 


ggatacggag 


gcgggtcgtt 


ttttggcctt 


ggtcgctggt 


catgaaaaac 


tagtaagcgc 


10620 


ggaaagcgac 


cgcccgcgat 


ggctcgctgc 


cgtagtctgg 


agaaagaatc 


gccagggttg 


10680 


cgttgcggtg 


tgccccggtt 


cgagcctcag 


cgctcggcgc 


cggccggatt 


ccgcggctaa 


10740 


cgtgggcgtg 


gctgccccgt 


cgtttccaag 


accccttagc 


cagccgactt 


ctccagttac 


10800 


ggagcgagcc 


cctctttttc 


ttgtgttttt 


gccagatgca 


tcccgtactg 


cggcagatgc 


10860 


gcccccaccc 


tccacctcaa 


ccgcccctac 


cgccgcagca 


gcagcaacag 


ccggcgcttc 


10920 


tgcccccgcc 


ccagcagcag 


ccagccacta 


ccgcggcggc 


cgccgtgagc 


ggagccggcg 


10980 


ttcagtatga 


cctggccttg 


gaagagggcg 


aggggctggc 


gcggctgggg 


gcgtcgtcgc 


11040 


cggagcggca 


cccgcgcgtg 


cagatgaaaa 


gggacgctcg 


cgaggcctac 


gtgcccaagc 


11100 


agaacctgtt 


cagagacagg 


agcggcgagg 


agcccgagga 


gatgcgcgcc 


tcccgcttcc 


11160 


acgcggggcg 


ggagctgcgg 


cgcggcctgg 


accgaaagcg 


ggtgctgagg 


gacgaggatt 


11220 


tcgaggcgga 


cgagctgacg 


gggatcagcc 


ccgcgcgcgc 


gcacgtggcc 


gcggccaacc 


11280 


tggtcacggc 


gtacgagcag 


accgtgaagg 


aggagagcaa 


cttccaaaaa 


tccttcaaca 


11340 


accacgtgcg 


cacgctgatc 


gcgcgcgagg 


aggtgaccct 


gggcctgatg 


cacctgtggg 


11400 


acctgctgga 


ggccatcgtg 


cagaacccca 


cgagcaagcc 


gctgacggcg 


cagctgtttc 


11460 


tggtggtgca 


gcacagtcgg 


gacaacgaga 


cgttcaggga 


ggcgctgctg 


aatatcaccg 


11520 


agcccgaggg 


ccgctggctc 


ctggacctgg 


tgaacattct 


gcagagcatc 


gtggtgcagg 


11580 


agcgcgggct 


gccgctgtcc 


gagaagctgg 


cggctatcaa 


cttctcggtg 


ctgagcctgg 


11640 


gcaagtacta 


cgctaggaag 


atctacaaga 


ccccgtacgt 


gcccatagac 


aaggaggtga 


11700 


agatcgacgg 


gttttacatg 


cgcatgaccc 


tgaaagtgct 


gaccctgagc 


gacgatctgg 


11760 
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gggtgtaccg caacgacagg atgcaccgcg cggtgagcgc cagccgccgg cgcgagctga 11820 
gcgaccagga gctgatgcac agcctgcagc gggccctgac cggggccggg accgaggggg 11880 
agagctactt tgacatgggc gcggacctgc gctggcagcc cagccgccgg gccttggaag 11940 
ctgccggcgg ttccccctac gtggaggagg tggacgatga ggaggaggag ggcgagtacc 12000 
tggaagactg atggcgcgac cgtatttttg ctagatgcag caacagccac cgcctcctga 12060 
tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 12120 

gacccaggcc atgcaacgca tcatggcgct gacgacccgc aatcccgaag cctttagaca 12180 

gcagcctcag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctcgaa 1224 0 

ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 12300 

cggcgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 12360 

caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg cggtgtcgca 12420 

gcgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgctga acgccttcct 12480 

gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 12540 

gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg ggccggacta 12600 

cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 12660 

cttgcaggga ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct 12720 

gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg acagcggcag 12780 

cgtgagccgc gactcgtacc tgggctacct gcttaacctg taccgcgagg ccatcgggca 12840 

ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg cgctgggcca 12900 

ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc ggtcgcagaa 12960 

gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct acgtgcagca 13020 

gagcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgccgcgc tcgacatgac 13080 

cgcgcgcaac atggagccca gcatgtacgc tcgcaaccgc ccgttcatca ataagctgat 13140 

ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg ccatcttgaa 13200 

cccgcactgg ctcccgccgc ccgggttcta cacgggcgag tacgacatgc ccgaccccaa 13260 

cgacgggttc ctgtgggacg acgtggacag cagcgtgttc tcgccgcgcc ccgccaccac 13320 

cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg gtcgcgcggg 13380 

tgctgccgcg gcggtgcctg aggccgccag ccccttcccg agcctgccct tttcgctgaa 13440 

cagcgtgcgc agcagcgagc tgggtcggct gacgcggccg cgcctgctgg gcgaggagga 13500 
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gtacctgaac 


gactccttgt tgaggcccga gcgcgagaag 


aacttcccca 


ataacgggat 


13560 


agagagcctg 


gtggacaaga tgagccgctg gaagacgtac 


gcgcacgagc 


acagggacga 


13620 


gccccgagct 


agcagcagcg caggcacccg tagacgccag 


cgacacgaca 


ggcagcgggg 


13680 


tctggtgtgg 


gacgatgagg attccgccga cgacagcagc 


gtgttggact 


tgggtgggag 


13740 


tggtggtggt 


aacccgttcg ctcacttgcg cccccgtatc 


gggcgcctga 


tgtaagaatc 


13800 


tgaaaaaata 


aaaaacggta ctcaccaagg ccatggcgac 


cagcgtgcgt 


tcttctctgt 


13860 


tgtttgtagt 


agt atg atg agg cgc gtg tac ccg gag ggt cct 
Met Met Arg Arg Val Tyr Pro Glu Gly Pro 
1 5 10 


cct ccc 
Pro Pro 


13909 



teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc 
Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro 
15 20 25 



13957 



ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg gag 
Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu 
30 35 40 



14005 



ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat acc 
Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
45 50 55 60 



14053 



acc egg ttg tac ctg gtg gac aac aag teg gcg gac ate gee teg ctg 
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu 
65 70 75 



14101 



aac tac cag aac gac cac age aac ttc ctg acc acc gtg gtg cag aac 
Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 
80 85 90 



14149 



aac gat ttc acc ccc acg gag gee age acc cag acc ate aac ttt gac 
Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp 
95 100 105 



14197 



gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc aac 
Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr lie Met His Thr Asn 
110 115 120 



14245 



atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg egg 
Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg 
125 130 135 "* 140 



14293 



gtg atg gtc teg cgc aag acc ccc aat ggg gtc gcg gtg gat gag aat 
Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn 
145 150 155 



14341 



tat gat ggt agt cag gac gag ctg act tac gag tgg gtg gag ttt gag 
Tyr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu 
160 165 170 



14389 
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ctg ccc gag ggc aac ttc teg gtg acc atg ace ate gat ctg atg aac 14437 
Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr lie Asp Leu Met Asn 
175 180 185 

aac gcc ate ate gac aac tac ttg gcg gtg ggg cgt cag aac ggg gtg 14485 
Asn Ala lie lie Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val 
190 195 200 

ctg gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg 14533 
Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu 
205 210 215 " 220 

ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac 14581 
Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn 
225 230 235 

gag gcc ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac 14629 
Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp 
240 245 250 

ttc acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag 14677 
Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin 
255 260 265 

ccc ttc cag gag ggc ttc cag ate ctg tac gag gac ctg gag ggg ggc 14725 
Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu Gly Gly 
270 275 " 280 

aac ate ccc gcg etc ttg gat gtc gaa gcc tat gag aaa age aag gag 14773 
Asn He Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu 
285 290 295 300 

gag gcc gcc gca gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag 14821 
Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu 
305 310 315 

gtg egg ggc gat aat ttt get age gcc gcg gca gtg gcc gag gcg get 14869 
Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala 
320 325 330 

gaa acc gaa agt aag ata gtc ate cag ccg gtg gag aag gac age aag 14917 
Glu Thr Glu Ser Lys He Val He Gin Pro Val Glu Lys Asp Ser Lys 
335 340 3.45 

gac agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc 14965 
Asp Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg 
350 355 360 

age tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc 15013 
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
365 370 375 " "* 380 

tec tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa 15061 
Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin 
385 390 ~ ^ 395 
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gtc tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15109 
Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 
400 405 410 

tec acg cgt caa gtt age aac tac ccg gtg gtg ggc gec gag etc ctg 15157 
Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu 
415 420 425 

ccc gtc tac tec aag age ttc ttc aac gag cag gec gtc tac teg cag 15205 
Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin 
430 435 440 

cag ctg cgc gee ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc 15253 
Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro 
445 450 455 460 

gag aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc 15301 
Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val 
465 470 475 

agt gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc 15349 
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 
480 485 490 

age agt ate egg gga gtc cag cgc gtg acc gtc act gac gee aga cgc 15397 
Ser Ser lie Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg 
495 500 505 



cgc acc tgc ccc tac gtc tac aag gee ctg ggc gta gtc gcg ccg cgc 
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arq 
510 515 520 



15445 



gtc etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 15499 
Val Leu Ser Ser Arg Thr Phe 
525 530 

aacaceggtt ggggcctgcg cgcgcccagc aagatgtacg gaggegcteg ccaacgctcc 15559 

acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc 15619 

cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc 15679 

aactacacgc ccgccgccgc gcccgcctcc accgtggacg ccgtcatcga cagcgtggtg 15739 

gccgatgcgc gccggtacgc ccgcgccaag agccggcggc ggcgcatcgc ccggcggcac 15799 

cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc geagggecag gcgcacggga 15859 

cgcagggcca tgetcaggge ggccagacgc gcggcctccg gcagcagcag cgccggcagg 15919 

acccgcagac gcgcggccac ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg 15979 

cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc 16039 

cgcccccctc gcacttgaag atgetgaett cgcgatgttg atgtgtccca gcggcgagga 16099 

ggatgtccaa gegcaaatae aaggaagaga tgctccaggt catcgcgcct gagatctacg 16159 



80 



WO 03/046124 



PCT/US02/33645 



gccccgcggt 


gaaggaggaa 


agaaagcccc 


gcaaactgaa 


gcgggtcaaa 


aaggacaaaa 


16219 


aggaggagga 


agatgtggac 


ggactggtgg 


agtttgtgcg 


cgagttcgcc 


ccccggcggc 


16279 


gcgtgcagtg 


gcgcgggcgg 


aaagtgaaac 


cggtgctgcg 


gcccggcacc 


acggtggtct 


16339 


tcacgcccgg 


cgagcgttcc 


ggctccgcct 


ccaagcgctc 


ctacgacgag 


gtgtacgggg 


16399 


acgaggacat 


cctcgagcag 


gcggtcgagc 


gtctgggcga 


gtttgcttac 


ggcaagcgca 


16459 


gccgccccgc 


gcccttgaaa 


gaggaggcgg 


tgtccatccc 


gctggaccac 


ggcaacccca 


16519 


cgccgagcct 


gaagccggtg 


accctgcagc 


aggtgctgcc 


gagcgcggcg 


ccgcgccggg 


16579 


gcttcaagcg 


cgagggcggc 


gaggatctgt 


acccgaccat 


gcagctgatg 


gtgcccaagc 


16639 


gccagaagct 


ggaggacgtg 


ctggagcaca 


tgaaggtgga 


ccccgaggtg 


cagcccgagg 


16699 


tcaaggtgcg 


gcccatcaag 


caggtggccc 


cgggcctggg 


cgtgcagacc 


gtggacatca 


16759 


agatccccac 


ggagcccatg 


gaaacgcaga 


ccgagcccgt 


gaagcccagc 


accagcacca 


16819 


tggaggtgca 


gacggatccc 


tggatgccgg 


cgccggcttc 


caccactcgc 


cgaagacgca 


16879 


agtacggcgc 


ggccagcctg 


ctgatgccca 


actacgcgct 


gcatccttcc 


atcatcccca 


16939 


cgccgggcta 


ccgcggcacg 


cgcttctacc 


gcggctacac 


cagcagccgc 


cgcaagacca 


16999 


ccacccgccg 


ccgccgtcgt 


cgcacccgcc 


gcagcagcac 


cgcgacttcc 


gccgccgccc 


17059 


tggtgcggag 


agtgtaccgc 


agcgggcgcg 


agcctctgac 


cctgccgcgc 


gcgcgctacc 


17119 


acccgagcat 


cgccatttaa 


ctctgccgtc 


gcctcctact 


tgcagatatg 


gccctcacat 


17179 


gccgcctccg 


cgtccccatt 


acgggctacc 


gaggaagaaa 


gccgcgccgt 


agaaggctga 


17239 


cggggaacgg 


gctgcgtcgc 


catcaccacc 


ggcggcggcg 


cgccatcagc 


aagcggttgg 


17299 


ggggaggctt 


cctgcccgcg 


ctgatcccca 


tcatcgccgc 


ggcgatcggg 


gcgatccccg 


17359 


gcatagcttc 


cgtggcggtg 


caggcctctc 


agcgccactg 


agacacagct 


tggaaaattt 


17419 


gtaataaaaa 


aatggactga 


cgctcctggt 


cctgtgatgt 


gtgtttttag 


atggaagaca 


17479 


tcaatttttc 


gtccctggca 


ccgcgacacg 


gcacgcggcc 


gtttatgggc 


acctggagcg 


17539 


acatcggcaa 


cagccaactg 


aacgggggcg 


ccttcaattg 


gagcagtctc 


tggagcgggc 


17599 


ttaagaattt 


cgggtccacg 


ctcaaaacct 


atggcaacaa 


ggcgtggaac 


agcagcacag 


17659 


ggcaggcgct 


gagggaaaag 


ctgaaagagc 


agaacttcca 


gcagaaggtg 


gtcgatggcc 


17719 


tggcctcggg 


catcaacggg 


gtggtggacc 


tggccaacca 


ggccgtgcag 


aaacagatca 


17779 


acagccgcct 


ggacgcggtc 


ccgcccgcgg 


ggtccgtgga 


gatgccccag 


gtggaggagg 


17839 
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agctgcctcc 


cctggacaag 


cacaacaaca 


aacaaccaccr 


tcccgacgcg 


aaaoaaacac 


17899 


tgctgacgca 


cacaaacaaa 


ccgcccccgt 


acgaggaggc 


acf taaaacta 


ggt ctgccca 


17959 


ccacgcggcc 


ccrtcraccrcct 


ctggccaccg 


gggtgctgaa 


acccagcagc 


acrcacrccacrc 


18019 


ccgcgaccct 


ggacttgcct 


cccfcctactt 


cccacccctc 


cacaataoct 


aacrcccctcrc 


18079 


caccaataac 


ccrtcacatca 


cacacccccc 


oacrcrcccrccc 


cca.aac.es a. a c 


tcrcrca ciacrca 


18139 


ctctgaacag 


catcgtgggt 


ctgggagtgc 


agagtgtgaa 


gcgccgccgc 


tgctattaaa 


18199 


agacactgta 


gcgcttaact 


tgcttgtctg 


tgtgtatatg 


tatgtccgcc 


gaccagaagg 


18259 


aggaagaggc 


gcgtcgccga 


gttgcaag atg gcc acc cca teg atg 


ctg ccc 


18311 



Met Ala Thr Pro Ser Met Leu Pro 
535 



cag tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg 
Gin Trp Ala Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu 
540 545 550 555 



18359 



agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 
Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 
560 565 570 



18407 



ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg 
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 
575 580 585 



18455 



acc acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac 
Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 
590 595 600 



18503 



cgc gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg 
Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
605 610 615 



18551 



ggc gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc 
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg 
620 625 630 635 



18599 



ggc gtg ctg gat egg ggg ccc age ttc aaa ccc tac tec ggc acc gcc 
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 
640 645 " 650 



18647 



tac aac age ctg get ccc aag gga gcg ccc aac act tgc cag tgg aca 
Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr 
655 " 660 665 



18695 



tat aaa get ggt gat act gat aca gaa aaa acc tat aca tat gga aat 
Tyr Lys Ala Gly Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn 
670 675 680 



18743 



gca cct gtg caa ggc att age att aca aag gat ggt att caa ctt gga 
Ala Pro Val Gin Gly He Ser He Thr Lys Asp Gly He Gin Leu Gly 
685 690 695 



18791 
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act gac age gat ggt cag gca ate tat gca gac gaa act tat caa cca 18839 
Thr Asp Ser Asp Gly Gin Ala lie Tyr Ala Asp Glu Thr Tyr Gin Pro 
700 705 710 715 

gag cct caa gtg ggt gat get gaa tgg cat gac ate act ggt act gat 18887 
Glu Pro Gin Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp 
720 725 730 

gaa aaa tat gga ggc aga get ctt aag cct gac ace aaa atg aag cct 18935 
Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro 
735 740 745 

tgc tat ggt tct ttt gee aag cct ace aat aaa gaa gga ggc cag gca 18983 
Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala 
750 755 760 

aat gtg aaa ace gaa aca ggc ggt acc aaa gaa tat gac att gac atg 19031 
Asn Val Lys Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met 
765 770 775 

gca ttc ttc gat aat cga agt gca get gee gec ggc eta gec cca gaa 19079 
Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu 
780 785 790 795 

att gtt ttg tat act gag aat gtg gat ctg gaa act cca gat acc cat 19127 
lie Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His 
800 805 *" 810 

att gta tac aag gca ggt aca gat gac agt age tct tct ate aat ttg 19175 
lie Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu 
815 820 825 

ggt cag cag tec atg ccc aac aga ccc aac tac att ggc ttc aga gac 19223 
Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp 
830 835 840 

aac ttt ate ggt ctg atg tac tac aac age act ggc aat atg ggt gta 19271 
Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val 
845 850 855 

ctg get gga cag gee tec cag ctg aat get gtg gtg gac ttg cag gac 19319 
Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp 
860 865 870 875 

aga aac acc gaa ctg tec tac cag etc ttg ctt gac tct ctg ggt gac 19367 
Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp 
880 885 890 

aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac agt tat gac 19415 
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp 
895 900 905 

ccc gat gtg cgc att att gaa aat cac ggt gtg gag gat gaa ctt cct 19463 
Pro Asp Val Arg He He Glu Asn His Gly Val Glu Asp Glu Leu Pro 
910 915 920 
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aac tat tgc ttc ccc ctg gat get gtg ggt aga act gat act tac cag 
Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin 
925 930 935 



19511 



gga att aag gec aat ggt gat aat caa acc acc tgg acc aaa gat gat 
Gly lie Lys Ala Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp 
940 945 950 955 



19559 



act gtt aat gat get aat gaa ttg ggc aag ggc aat cct ttc gec atg 
Thr Val Asn Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe Ala Met 
960 " 965 970 



19607 



gag ate aac ate cag gec aac ctg tgg egg aac ttc etc tac gcg aac 
Glu lie Asn lie Gin Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn 
975 980 985 



19655 



gtg gcg ctg tac ctg ccc gac tec tac aag tac acg ccg gec aac ate 
Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn lie 
990 995 "* 1000 



19703 



acg ctg ccc acc aac acc aac acc tac gat tac atg aac ggc cgc 
Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg 
1005 1010 1015 



19748 



gtg gtg gcg ccc teg ctg gtg gac gee tac ate aac ate ggg gcg 
Val Val Ala Pro Ser Leu Val Asp Ala Tyr He Asn He Gly Ala 
1020 1025 1030 



19793 



cgc tgg teg ctg gac ccc atg gac aac gtc aac ccc ttc aac cac 
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His 
1035 1040 1045 



19838 



cac cgc aac gcg ggc ctg cga tac cgc tec atg etc ctg ggc aac 
His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn 
1050 1055 1060 



19883 



ggg cgc tac gtg ccc ttc cac ate cag gtg ccc caa aag ttt ttc 
Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys Phe Phe 
1065 1070 1075 



19928 



gee ate aag age etc ctg etc ctg ccc ggg tec tac acc tac gag 
Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu 
1080 1085 ~ 1090 



19973 



tgg aac ttc cgc aag gac gtc aac atg ate ctg cag age tec etc 
Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser Leu 
1095 1100 1105 



20018 



ggc aac gac ctg cgc acg gac ggg gec tec ate gee ttc acc age 
Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr Ser 
1110 1115 1120 



20063 



ate aac etc tac gee acc ttc ttc ccc atg gcg cac aac acc gee 
He Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
1125 1130 1135 



20108 
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tec acg etc gag gec atg ctg cgc aac gac acc aac gac cag tec 
Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 
1140 1145 1150 



20153 



ttc aac gac tac etc teg gcg gec aac atg etc tac ccc ate ccg 
Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro 
1155 1160 1165 



20198 



gec aac gee acc aac gtg ccc ate tec ate ccc teg cgc aac tgg 
Ala Asn Ala Thr Asn Val Pro lie Ser lie Pro Ser Arg Asn Trp 
1170 1175 1180 



20243 



gec gee ttc cgc ggc tgg tec ttc acg cgc etc aag acc cgc gag 
Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu 
1185 1190 1195 



20288 



acg ccc teg etc ggc tec ggg ttc gac ccc tac ttc gtc tac teg 
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser 
1200 1205 1210 



20333 



ggc tec ate ccc tac etc gac ggc acc ttc tac etc aac cac acc 
Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr 
1215 1220 ~ 1225 



20378 



ttc aag aag gtc tec ate acc ttc gac tec tec gtc age tgg ccc 
Phe Lys Lys Val Ser He Thr Phe Asp Ser Ser Val Ser Trp Pro 
1230 1235 1240 



20423 



ggc aac gac cgc etc ctg acg ccc aac gag ttc gaa ate aag cgc 
Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys Arg 
1245 " 1250 1255 



20468 



acc gtc gac gga gag ggg tac aac gtg gee cag tgc aac atg acc 
Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 
1260 1265 1270 



20513 



aag gac tgg ttc ctg gtc cag atg ctg gee cac tac aac ate ggc 
Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn He Gly 
1275 1280 1285 



20558 



tac cag ggc ttc tac gtg ccc 
Tyr Gin Gly Phe Tyr Val Pro 
1290 " 1295 



gag ggc tac aag gac cgc atg tac 
Glu Gly Tyr Lys Asp Arg Met Tyr 
1300 



20603 



tec ttc ttc cgc aac ttc cag ccc atg age cgc cag gtc gtg gac 
Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp 
1305 1310 1315 



20648 



gag gtc aac tac aag gac tac cag gec gtc acc ctg gee tac cag 
Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin 
1320 1325 1330 



20693 



cac aac aac teg ggc ttc gtc ggc tac etc gcg ccc acc atg cgc 
His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg 
1335 1340 1345 



20738 
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cag ggc cag ccc tac ccc gcc aac tac ccc tac ccg etc ate ggc 20783 

Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu lie Gly 
1350 ^ 1355 1360 

aag age gcc gtc gcc age gtc acc cag aaa aag ttc etc tgc gac 20828 

Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 
1365 1370 1375 

egg gtc atg tgg cgc ate ccc ttc tec age aac ttc atg tec atg 20873 

Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met 
1380 1385 1390 

ggc gcg etc acc gac etc ggc cag aac atg etc tac gcc aac tec 20918 

Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser 
1395 1400 1405 

gcc cac gcg eta gac atg aat ttc gaa gtc gac ccc atg gat gag 20963 

Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu 
1410 1415 1420 

tec acc ctt etc tat gtt gtc ttc gaa gtc ttc gac gtc gtc cga 21008 

Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg 
1425 1430 1435 

gtg cac cag ccc cac cgc ggc gtc ate gag gcc gtc tac ctg cgc 21053 

Val His Gin Pro His Arg Gly Val lie Glu Ala Val Tyr Leu Arg 
1440 1445 1450 



acg ccc ttc teg gcc ggc aac gcc acc acc taa gcctcttgct 21096 
Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1455 1460 



tettgeaaga 


tgacggcctg 


cgcgggctcc 


ggegagcagg 


agctcagggc 


catcctccgc 


21156 


gacctgggct 


gcgggccctg 


cttcctgggc 


accttcgaca 


agcgcttccc 


gggattcatg 


21216 


gccccgcaca 


agctggcctg 


cgccatcgtc 


aacacggccg 


gccgcgagac 


egggggegag 


21276 


cactggctgg 


ccttcgcctg 


gaacccgcgc 


tcccacacct 


gctacctctt 


cgaccccttc 


21336 


gggttctcgg 


acgagcgcct 


caagcagatc 


taccagttcg 


agtacgaggg 


cctgctgcgt 


21396 


cgcagcgccc 


tggccaccga 


ggaccgctgc 


gtcaccctgg 


aaaagtccac 


ccagaccgtg 


21456 


cagggtccgc 


gctcggccgc 


ctgcgggctc 


ttctgetgea 


tgttcctgca 


cgccttcgtg 


21516 


cactggcccg 


accgccccat 


ggacaagaac 


cccaccatga 


acttgetgae 


gggggtgccc 


21576 


aacggcatgc 


tccagtcgcc 


ccaggtggaa 


cccaccctgc 


gccgcaacca 


ggaggegetc 


21636 


taccgcttcc 


tcaacgccca 


ctccgcctac 


tttcgctccc 


accgcgcgcg 


catcgagaag 


21696 


gccaccgcct 


tcgaccgcat 


gaatcaagac 


atgtaatccg 


gtgtgtgtat 


gtgaatgett 


21756 


tattcatcat 


aataaacagc 


acatgtttat 


gccaccttct 


ctgaggctct 


gactttattt 


21816 


agaaatcgaa 


ggggttctgc 


cggctctcgg 


catggcccgc 


gggcagggat 


acgttgcgga 


21876 
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actggtactt gggcagccac ttgaactcgg ggatcagcag cttcggcacg gggaggtcgg 21936 
ggaacgagtc gctccacagc ttgcgcgtga gttgcagggc gcccagcagg tcgggcgcgg 21996 
agatcttgaa atcgcagttg ggacccgcgt tctgcgcgcg agagttacgg tacacggggt 22056 
tgcagcactg gaacaccatc agggccgggt gcttcacgct cgccagcacc gtcgcgtcgg 22116 
tgatgccctc cacgtccaga tcctcggcgt tggccatccc gaagggggtc atcttgcagg 22176 
tctgccgccc catgctgggc acgcagccgg gcttgtggtt gcaatcgcag tgcaggggga 22236 
tcagcatcat ctgggcctgc tcggagctca tgcccgggta catggccttc atgaaagcct 22296 

ccagctggcg gaaggcctgc tgcgccttgc cgccctcggt gaagaagacc ccgcaggact 22356 

tgctagagaa ctggttggtg gcgcagccag cgtcgtgcac gcagcagcgc gcgtcgttgt 22416 

tggccagctg caccacgctg cgcccccagc ggttctgggt gatcttggcc cggtcggggt 22476 

tctccttcag cgcgcgctgc ccgttctcgc tcgccacatc catctcgatc gtgtgctcct 22536 

tctggatcat cacggtcccg tgcaggcacc gcagcttgcc ctcggcctcg gtgcacccgt 22596 

gcagccacag cgcgcagccg gtgctctccc agttcttgtg ggcgatctgg gagtgcgagt 22656 

gcacgaagcc ctgcaggaag cggcccatca tcgtggtcag ggtcttgttg ctggtgaagg 22716 

tcagcggaat gccgcggtgc tcctcgttca catacaggtg gcagatacgg cggtacacct 22776 

cgccctgctc gggcatcagc tggaaggcgg acttcaggtc gctctccacg cggtaccggt 22836 

ccatcagcag cgtcatcact tccatgccct tctcccaggc cgaaacgatc ggcaggctca 22896 

gggggttctt caccgttgtc atcttagtcg ccgccgccga agtcaggggg tcgttctcgt 22956 

ccagggtctc aaacactcgc ttgccgtcct tctcggtgat gcgcacgggg ggaaagctga 23016 

agcccacggc cgccagctcc tcctcggcct gcctttcgtc ctcgctgtcc tggctgatgt 23076 

cttgcaaagg cacatgcttg gtcttgcggg gtttcttttt gggcggcaga ggcggcggcg 23136 

gagacgtgct gggcgagcgc gagttctcgc tcaccacgac tatttcttct ccttggccgt 23196 

cgtccgagac cacgcggcgg taggcatgcc tcttctgggg cagaggcgga ggcgacgggc 23256 

tctcgcggtt cggcgggcgg ctggcagagc cccttccgcg ttcgggggtg cgctcctggc 23316 

ggcgctgctc tgactgactt cctccgcggc cggccattgt gttctcctag ggagcaagca 23376 

tggagactca gccatcgtcg ccaacatcgc catctgcccc cgccgccgcc gacgagaacc 23436 

agcagcagca gaatgaaagc ttaaccgccc cgccgcccag ccccacctcc gacgccgcag 23496 

ccccagacat gcaagagatg gaggaatcca tcgagattga cctgggctac gtgacgcccg 23556 
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attttatctt ctatcag atg tec aaa aag cgc gtc egg gtg gat gat gac 32126 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 
1465 1470 

ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32171 
Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 
1475 ~ 1480 ~ 1485 

ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 32216 
Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 
1490 1495 1500 

aag ccc ctg ggg gtg ctg tec ctg cga ctg get gac ccc gtc acc 32261 
Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 
1505 1510 1515 

acc aag aac ggg gaa ate acc etc aag ctg gga gag ggg gtg gac 32306 
Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Glu Gly Val Asp 
1520 1525 " 1530 

etc gac tec teg gga aaa etc ate tec aac acg gee acc aag gee 32351 
Leu Asp Ser Ser Gly Lys Leu lie Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 

gee gec cct etc agt ttt tec aac aac acc att tec ctt aac atg 32396 
Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr lie Ser Leu Asn Met 
1550 1555 1560 

gat acc cct ctt tat acc aaa gat gga aaa tta tec tta caa gtt 32441 
Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gin Val 
1565 1570 1575 

tct cca ccg tta aac ata tta aaa tea acc att ctg aac aca tta 3248 6 

Ser Pro Pro Leu Asn lie Leu Lys Ser Thr lie Leu Asn Thr Leu 
1580 1585 1590 

get gta get tat gga tea ggt tta gga ctg agt ggt ggc act get 32531 
Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala 
1595 1600 1605 

ctt gca gta cag ttg gec tct cca etc act ttt gat gaa aaa gga 32576 
Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly 
1610 1615 1620 

aat att aaa att aac eta gec agt ggt cca tta aca gtt gat gca 32621 
Asn lie Lys lie Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
1625 1630 1635 

agt cga ctt agt ate aac tgc aaa aga ggg gtc act gtc act acc 32666 
Ser Arg Leu Ser lie Asn Cys Lys Arg Gly Val Thr Val Thr Thr 
1640 1645 1650 

tea gga gat gca att gaa age aac ata age tgg cct aaa ggt ata 32711 
Ser Gly Asp Ala lie Glu Ser Asn lie Ser Trp Pro Lys Gly lie 
1655 1660 1665 
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aga ttt gaa ggt aat ggc ata get gca aac att ggc aga gga ttg 32756 
Arg Phe Glu Gly Asn Gly lie Ala Ala Asn lie Gly Arg Gly Leu 
1670 1675 1680 

gaa ttt gga acc act agt aca gag act gat gtc aca gat gca tac 32801 
Glu Phe Gly Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr 
1685 1690 1695 



cca att caa gtt aaa ttg ggt act ggc ctt acc ttt gac agt aca 
Pro lie Gin Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr 
1700 1705 1710 



32846 



ggc gec att gtt get tgg aac aaa gag gat gat 
Gly Ala lie Val Ala Trp Asn Lys Glu Asp Asp 
1715 1720 ' 1725 



aaa ctt aca tta 
Lys Leu Thr Leu 



32891 



tgg acc aca gec gac ccc teg cca aat tgc aaa ata tac tct gaa 
Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys He Tyr Ser Glu 
1730 1735 1740 



32936 



aaa gat gec aaa etc aca ctt tgc ttg aca aag 
Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys 
1745 1750 1755 



tgt gga agt caa 
Cys Gly Ser Gin 



32981 



att ctg ggt act gtg act gta ttg gca gtg aat aat gga agt etc 
He Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu 
1760 1765 1770 



33026 



aac cca ate aca aac aca gta age act gca etc gtc tec etc aag 
Asn Pro He Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys 
1775 1780 1785 



33071 



ttt gat gca agt gga gtt ttg eta age age tec aca tta gac aaa 
Phe Asp Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys 
1790 1795 1800 



33116 



gaa tat tgg aac ttc aga aag gga gat gtt aca 
Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr 
1805 1810 ~ ' 1815 



cct get gag ccc 
Pro Ala Glu Pro 



33161 



tat act aat get ata ggt ttt atg cct aac ata aag gee tat cct 
Tyr Thr Asn Ala He Gly Phe Met Pro Asn He Lys Ala Tyr Pro 
1820 1825 1830 



33206 



aaa aac aca tct gca get tea aaa age cat att gtc agt caa gtt 
Lys Asn Thr Ser Ala Ala Ser Lys Ser His He Val Ser Gin Val 
1835 1840 1845 



33251 



tat etc aat ggg gat gag gee aaa cca ctg atg ctg att att act 33296 

Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu He He Thr 
1850 1855 1860 

ttt aat gaa act gag gat gca act tgc acc tac agt ate act ttt 33341 

Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser He Thr Phe 
1865 1870 1875 
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caa tgg aaa tgg gat agt act aag tac aca ggt gaa aca ctt act 
Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr Sy Ilu £ Leu Ala 33386 

1885 1890 



£ K £ S £ £ £ S E S K t9i a " ct9tat < 33435 

1895 1900 1905 

ccaccctgca tgccaaccct tcccacccca ctctgtctat ggaaaaaact ctgaagcaca 33495 
aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33555 
tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagccttga 33615 
acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33675 
agcgagccag tctcgggtcg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33735 
cctcacagct caacagctga ggattgtcct cggtggtcgg gatcacggtt atctggaaga 33795 
agcagaagag cggcggtggg aatcatagtc cgcgaacggg atcggccggt ggtgtcgcat 33855 
caggccccgc agcagtcgct gccgccgccg ctccgtcaag ctgctgctca gggggtccgg 33915 
gtccagggac tccctcagca tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg 33975 
ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acaggaccac 34035 
caggttgttc aacagtccat agttcaacac gctccagccg aaactcatcg cgggaaggat 34095 
gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggcgct ccctccagaa 34155 
cacgctgccc acgtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34215 
catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34275 
cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34335 
ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34395 
tatgctcatg catctcttca gcactctcag ctcctcgggg gtcaaaacca tatcccaggg 34455 
cacggggaac tcttgcagga cagcgaaccc cgcagaacag ggcaatcctc gcacataact 34515 
tacattgtgc atggacaggg tatcgcaatc aggcagcacc gggtgatcct ccaccagaga 34575 
agcgcgggtc tcggtctcct cacagcgtgg taagggggcc ggccgatacg ggtgatggcg 34635 
ggacgcggct gatcgtgttc gcgaccgtgt catgatgcag ttgctttcgg acattttcgt 34695 
acttgctgta gcagaacctg gtccgggcgc tgcacaccga tcgccggcgg cggtcccggc 34755 
gcttggaaog ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34815 
ctagggcctc aggagtgatg aagatcccat catgcctgat agctctgatc acatcgacca 34875 
ccgtggaatg ggccagaccc agccagatga tgcaattttg ttgggtttcg gtgacggcgg 34935 
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gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagcactt 34995 
caaaatgaag gtcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35055 
ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggcttccagc aaagcctcca 35115 
cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35175 
tcatgttaca ctcctgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35235 
gaactagttc ctgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35295 
ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35355 
cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctaagct cctccctcag 35415 
caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35475 

aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35535 

aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35595 

cagaaaatca cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtgcac 35655 

gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35715 

ttagctgatc tgtaaaaaac aaaaaataaa acattaaacc atgctagcct ggcgaacagg 35775 

tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35835 

aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35895 

gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35955 

gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36015 

atgaagcaca aaatcctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36075 

cgaagccccc gatccctcca gatacacata caaagcctca gcgtccatag cttaccgagc 36135 

agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36195 

ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36255 

gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36315 

cgcgcacttc ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatcg 36375 

gaattcgact ttcaaattcc gtcgaccgtt aaaaacgtca cccgccccgc ccctaacggt 36435 

cgcccgtctc tcggccaatc accttcctcc ctccccaaat tcaaacagct catttgcata 36495 

ttaacgcgca ccaaaagttt gaggtatatt attgatgatg 36535 
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<210> 10 
<211> 531 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 
<400> 10 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
15 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
S0 55 60 

Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn Tyr Gin Asn 

70 75 so 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr lie Met His Thr Asn Met Pro Asn Val 
1X5 120 125 



Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val 



135 



Ser 



140 



Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn Tyr Asp Gly Ser 

150 155 160 

Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly 
165 170 175 

Asn Phe Ser Val Thr Met Thr lie Asp Leu Met Asn Asn Ala lie lie 
180 i 8 5 19Q 

Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp 
195 200 205 

He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro 
^ AU 215 220 

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His 

230 235 240 

Pro Asp lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser 
245 250 255 

Arg Leu Ser Asn Leu Leu Gly lie Arg Lys Arg Gin Pro Phe Gin Glu 
260 265 270 
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Gly Phe Gin lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn lie Pro Ala 
275 280 ~ 285 

Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala 
290 295 300 

Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp 
305 310 315 ^ 320 

Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser 
325 330 335 

Lys lie Val lie Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr 
340 345 * 350 

Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu 
355 360 365 

Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu 
370 ' 375 380 

Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser 
3Q 5 390 395 400 

Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin 
405 410 415 

Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser 
420 425 430 

Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala 
435 440 445 

Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He 
450 455 460 

Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val 
465 470 475 480 

Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Arg 
485 490 " 495 

Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro 
500 505 " 510 

Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser 
515 520 525 

Arg Thr Phe 
530 

<210> 11 

<211> 932 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 
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<400> 11 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His He Ala 
1 5 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly Asp Thr Asp Thr 
130 135 140 

Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly He Ser He 
145 150 155 160 

Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Ser Asp Gly Gin Ala He 
165 170 " 175 

Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 
180 185 190 

Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 
195 200 ^ ~ 205 

Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 
210 215 220 

Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 
225 230 235 240 

Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 
245 250 255 

Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn Val 
260 265 270 



Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr Asp 
275 280 285 
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Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
290 295 300 

Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
305 310 315 " 320 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
325 330 335 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
340 345 350 

Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
355 360 365 

Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn 
370 375 380 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
385 390 395 400 

Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Asp Asn 
405 410 ~ 415 

Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
420 425 430 

Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn Leu 
435 440 445 

Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 
450 455 460 

Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn Thr 
465 470 475 480 

Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala 
485 490 495 

Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val 
500 505 510 . 

Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met 
515 520 525 

Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin 
530 535 540 

Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr 
545 550 555 560 

Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser 
565 570 575 

Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr Ser 
580 585 590 
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lie Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser 
595 600 605 

Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn 
610 615 620 

Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala 
625 630 ~ 635 640 

Thr Asn Val Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe Arg 
645 650 * 655 

Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly 
660 665 670 

Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu 
675 680 " 685 

Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He Thr 
690 695 700 

Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro 
705 710 715 720 

Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val 
725 730 735 

Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu Ala 
740 745 750 

His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys 
755 760 765 

Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 
770 775 780 

Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala 
785 790 " 795 800 

Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met 
805 810 "* 815 

Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly 
820 825 830 

Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp Arg 
835 840 845 

Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala 
850 855 860 

Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala 
865 870 875 880 

Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu 
885 890 ~ 895 
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Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His 
900 905 910 

Arg Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
915 920 925 

Asn Ala Thr Thr 
930 



<210> 12 

<211> 443 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 

<400> 12 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe He Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser 
65 70 75 ' 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

He Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu 
100 105 ~ ~ 110 

Ser Leu Gin Val Ser Pro Pro Leu Asn He Leu Lys Ser Thr He Leu 
115 120 125 

Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly 
130 135 140 

Thr Ala Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys 
145 150 155 160 

Gly Asn He Lys He Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
165 170 175 

Ser Arg Leu Ser He Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ser 
180 " 185 190 

Gly Asp Ala He Glu Ser Asn He Ser Trp Pro Lys Gly He Arg Phe 
195 200 205 



102 



WO 03/046124 



PCTYUS02/33645 



Glu Gly Asn Gly He Ala Ala Asn He Gly Arg Gly Leu Glu Phe Gly 
210 215 220 

Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro He Gin Val 
225 230 235 240 

Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala He Val Ala 
245 250 255 

Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro 
260 265 270 

Ser Pro Asn Cys Lys He Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu 
275 280 285 

Cys Leu Thr Lys Cys Gly Ser Gin He Leu Gly Thr Val Thr Val Leu 
290 295 300 

Ala Val Asn Asn Gly Ser Leu Asn Pro He Thr Asn Thr Val Ser Thr 
305 310 315 320 

Ala Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Leu Ser Ser 
325 330 " 335 

Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr 
340 345 350 

Pro Ala Glu Pro Tyr Thr Asn Ala He Gly Phe Met Pro Asn He Lys 
355 360 365 

Ala Tyr Pro Lys Asn Thr Ser Ala Ala Ser Lys Ser His He Val Ser 
370 375 380 

Gin Val Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu He He 
385 390 395 400 

Thr Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser He Thr Phe 
405 410 415 

Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala Thr 
420 425 ~ 430 



Ser Ser Phe Thr Phe Ser Tyr He Ala Gin Glu 
435 440 



<210> 13 

<211> 338 

<212> PRT 

<213> simian serotype CI 

<400> 13 

Ala Pro Lys Gly Ala Pro Asn Thr Ser Gin Trp Leu Asp Lys Gly Val 
1 5 10 15 
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Thr Thr Thr Asp Asn Asn Thr Glu Asn Gly Asp Glu Glu Asp Glu Val 
20 25 30 

Ala Glu Glu Gly Glu Glu Glu Lys Gin Ala Thr Tyr Thr Phe Gly Asn 
35 40 45 

Ala Pro Val Lys Ala Glu Ala Glu lie Thr Lys Glu Gly Leu Pro lie 
50 55 60 

Gly Leu Glu Val Pro Ser Glu Gly Asp Pro Lys Pro He Tyr Ala Asp 
65 70 75 80 

Lys Leu Tyr Gin Pro Glu Pro Gin Val Gly Glu Glu Ser Trp Thr Asp 
85 90 95 

Thr Asp Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Glu 
100 105 110 

Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Val 
115 " 120 125 

Lys Gly Gly Gin Ala Lys Val Lys Lys Val Glu Glu Gly Lys Val Glu 
130 135 140 

Tyr Asp He Asp Met Asn Phe Phe Asp Leu Arg Ser Gin Lys Thr Gly 
145 150 155 ' 160 

Leu Lys Pro Lys He Val Met Tyr Ala Glu Asn Val Asp Leu Glu Thr 
165 ~ 170 ~ 175 

Pro Asp Thr His Val Val Tyr Lys Pro Gly Ala Ser Asp Ala Ser Ser 
180 185 190 

His Ala Asn Leu Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr He 
195 200 205 

Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly 
210 215 ~ 220 

Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 
225 230 235 240 

Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp 
245 250 255 

Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
260 " " 265 270 

Asp Ser Tyr Asp Pro Asp Val Arg Val He Glu Asn His Gly Val Glu 
275 " 280 285 

Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Val Gly Pro Arg 
290 295 300 



Thr Asp Ser Tyr Lys Gly He Glu Thr Asn Gly Asp Glu Asn Thr Thr 
305 310 315 320 



104 



WO 03/046124 



PCT/US02/33645 



Trp Lys Asp Leu Asp Pro Asn Gly lie Ser Glu Leu Ala Lys Gly Asn 
325 330 335 

Pro Phe 



<210> 14 

<211> 315 

<212> PRT 

<213> chimpanzee adenovirus Pan-9 

<400> 14 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
1 5 10 15 

Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 30 

Gin Gly lie Asn lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 " " 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 110 

Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu 
130 135 ~ 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr 
145 150 155 160 

Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin 
165 170 ~ 175 

Ala Met Pro Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He 
180 185 " 190 

Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 
195 200 205 

Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 
210 215 220 
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Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 
225 230 235 240 



Tyr Phe Ser Met Trp Asn Gin Ala 
245 

Arg lie lie Glu Asn His Gly Val 
260 

Phe Pro Leu Asp Ala Val Gly Arg 
275 280 

Ala Asn Gly Thr Asp Gin Thr Thr 
290 " * 295 



Val Asp Ser Tyr Asp Pro Asp Val 
250 255 



Glu Asp Glu Leu Pro Asn Tyr Cys 
265 270 



Thr Asp Thr Tyr Gin Gly lie Lys 
285 



Trp Thr Lys Asp Asp Ser Val Asn 
300 



Asp Ala Asn Glu lie Gly Lys Gly Asn Pro Phe 
305 310 315 



<210> 15 

<211> 315 

<212> PRT 

<213> chimpanzee adenovirus Pan-5 

<400> 15 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
1 5 10 15 

Gly Asp Thr Gly Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 ~ 30 

Gin Gly lie Ser lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 "* 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 110 

Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu lie Val Leu 
130 135 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His lie Val Tyr 
145 150 155 160 
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Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin 
165 170 175 

Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie 
180 185 ^ 190 

Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 
195 200 205 

Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 
210 215 220 

Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 
225 230 235 * 240 

Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val 
245 250 255 

Arg lie lie Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys 
260 265 270 

Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly lie Lys 
275 280 285 

Ala Asn Gly Ala Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn 
290 295 ~ 300 

Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe 
305 310 " 315 



<210> 16 
<211> 324 
<212> PRT 

<213> chimpanzee adenovirus Pan-6 
<400> 16 

Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr 
15 10 15 

Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met 
20 25 30 

Gly Gly Glu Asn lie Thr Lys Asp Gly Leu Gin lie Gly Thr Asp Val 
35 40 45 

Thr Ala Asn Gin Asn Lys Pro lie Tyr Ala Asp Lys Thr Phe Gin Pro 
50 55 60 

Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 
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Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys 
100 105 110 

Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp lie Asp Leu Ala 
115 120 125 

Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys 
130 135 140 

Ala Asp lie Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp 
145 150 155 160 

Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu lie 
165 170 175 

Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe 
180 185 190 

Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 
195 200 205 

Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 
210 215 220 

Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu 
225 230 235 240 

Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 
245 250 255 

Tyr Asp Pro Asp Val Arg lie lie Glu Asn His Gly Val Glu Asp Glu 
260 265 270 

Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala 
275 280 " 285 

Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser 
290 295 300 

Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys 
305 310 ' 315 320 



Gly Asn lie Phe 



<210> 17 
<211> 314 
<212> PRT 

<213> chimpanzee adenovirus Pan-7 
<400> 17 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly 
15 10 15 

Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin 
20 25 ^ " 30 
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Gly lie Ser lie Thr Lys Asp Gly 
35 ** 40 

Gly Gin Ala lie Tyr Ala Asp Glu 
50 55 



lie Gin Leu Gly 



Thr Tyr Gin Pro 
60 



Thr Asp Ser Asp 
45 

Glu Pro Gin Val 



Gly Asp Ala Glu 
65 

Gly Arg Ala Leu 



Phe Ala Lys Pro 
100 



Trp His 
70 



Asp lie 
Asp Thr 
Thr Asn Lys Glu 



Lys Pro 
85 



Thr Gly Thr Asp 
75 

Lys Met Lys Pro 
90 

Gly Gly Gin Ala 
105 



Glu Lys Tyr Gly 
80 

Cys Tyr Gly Ser 
95 

Asn Val Lys Thr 
110 



Glu Thr Gly Gly Thr Lys Glu Tyr 
115 120 



Asp lie Asp Met 



Ala Phe Phe Asp 
125 



Asn Arg Ser Ala Ala Ala Ala Gly 
130 135 



Leu Ala Pro Glu 
140 



He Val Leu Tyr 



Thr Glu Asn Val Asp Leu 
145 150 



Glu Thr Pro Asp Thr His 
155 



He Val Tyr Lys 
160 



Ala Gly Thr Asp 



Asp Ser Ser Ser Ser He Asn Leu 
165 170 



Gly Gin Gin Ser 
175 



Met Pro Asn Arg 
180 



Pro Asn Tyr He 



Gly Phe Arg Asp 
185 



Asn Phe He Gly 
190 



Leu Met Tyr Tyr Asn Ser Thr Gly 
195 * 200 



Asn Met Gly Val 



Leu Ala Gly Gin 
205 



Ala Ser Gin Leu Asn Ala Val Val 
210 215 



Asp Leu Gin Asp 
220 



Arg Asn Thr Glu 



Leu Ser Tyr Gin Leu Leu 
225 230 



Leu Asp Ser Leu Gly Asp 
235 



Arg Thr Arg Tyr 
240 



Phe Ser Met Trp 



Asn Gin Ala Val Asp Ser Tyr Asp 
245 250 



Pro Asp Val Arg 
255 



He He Glu Asn His Gly 
260 

Pro Leu Asp Ala Val Gly 
275 

Asn Gly Asp Asn Gin Thr 
290 



Val Glu Asp Glu Leu Pro 
265 



Arg Thr 
280 

Thr Trp 
295 



Asp Thr Tyr Gin 



Thr Lys Asp Asp 
300 



Asn Tyr Cys Phe 
270 

Gly He Lys Ala 
285 

Thr Val Asn Asp 



Ala Asn Glu Leu Gly Lys 
305 ^ 310 



Gly Asn Pro Phe 
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<210> 18 

<211> 179 

<212> PRT 

<213> chimpanzee adenovirus Pan9 

<400> 18 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin lie Leu Ala' 
1 5 10 15 

Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 " 30 

lie Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn 
35 40 45 

Pro He Thr Gly Thr Val Ser Ser Ala Gin Val Phe Leu Arg Phe Asp 
50 55 60 

Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp 
65 70 75 " 80 

Gly Tyr Arg Gin Gly Asp Ser He Asp Gly Thr Pro Tyr Thr Asn Ala 
85 90 95 

Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gin Ser Ser 
100 105 ~ 110 

Thr Thr Lys Asn Asn He Val Gly Gin Val Tyr Met Asn Gly Asp Val 
115 120 125 

Ser Lys Pro Met Leu Leu Thr He Thr Leu Asn Gly Thr Asp Asp Ser 
130 135 140 

Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser 
145 150 155 " 160 

Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr He 
165 170 " 175 

Ala Gin Glu 



<210> 19 

<211> 185 

<212> PRT 

<213> chimpanzee adenovirus Pan6 

<400> 19 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser 
1 5 10 15 

Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 30 
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lie Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu 
35 40 45 

Asn Pro lie Asn Asp Thr Val Lys Ser Ala lie Val Phe Leu Arg Phe 
50 55 60 

Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 
65 70 75 ' 80 

Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr Asn 
85 90 95 

Ala Val Gly Phe Met Pro Asn He Gly Ala Tyr Pro Lys Thr Gin Ser 
100 105 ~ HO 

Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu Thr Gly Glu 
115 120 125 

Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn Gly Thr Asp Glu 
130 135 140 

Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr Trp Gin 
145 150 155 160 

Trp Thr Gly Asp Tyr Lys Asp Lys Asn He Thr Phe Ala Thr Asn Ser 
165 170 175 

Phe Ser Phe Ser Tyr He Ala Gin Glu 
180 185 



<210> 20 

<211> 179 

<212> PRT 

<213> chimpanzee adenovirus Pan7 

<400> 20 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys He Tyr Ser 
1 5 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 30 

He Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn 
35 40 45 

Pro He Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp 
65 70 75 80 

Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn Ala 
85 90 " 95 
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lie Gly Phe Met Pro Asn He Lys 
100 



Ala Ser Lys Ser His He Val Ser 
115 120 

Ala Lys Pro Leu Met Leu He He 
130 135 

Thr Cys Thr Tyr Ser He Thr Phe 
145 150 

Tyr Thr Gly Glu Thr Leu Ala Thr 
165 

Ala Gin Glu 



Ala Tyr Pro Lys Asn Thr Ser Ala 
105 no 

Gin Val Tyr Leu Asn Gly Asp Glu 
125 



Thr Phe Asn Glu Thr Glu Asp Ala 
140 



Gin Trp Lys Trp Asp Ser Thr Lys 

155 160 

Ser Ser Phe Thr Phe Ser Tyr He 

170 175 



<210> 21 
<211> 179 
<212> PRT 

<213> chimpanzee adenovirus Pan5 
<400> 21 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His He Tyr Ser 
15 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 * 30 

He Leu Gly Thr Val Ser Leu He Ala Val Asp Thr Gly Ser Leu Asn 
35 40 45 

Pro He Thr Gly Thr Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp 
65 70 75 80 

Asn Phe Arg Gin Gly Asp Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala 
85 90 95 

He Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Asn Thr Ser Glv 
100 105 no 

Ala Ala Lys Ser His He Val Gly Lys Val Tyr Leu His Gly Asp Thr 
H5 120 ~ 125 

Gly Lys Pro Leu Asp Leu He He Thr Phe Asn Glu Thr Ser Asp Glu 
130 135 140 

Ser Cys Thr Tyr Cys He Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin 
145 150 155 160 

Tyr Lys Asn Glu Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr He 
165 170 175 
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Ala Lys Glu 



<210> 22 

<211> 183 

<212> PRT 

<213> human adenovirus Ad 2 

<400> 22 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg He His Ser 
1 5 10 15 

Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 " 30 

Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser 
35 40 45 

Met Thr Gly Thr Val Ala Ser Val Ser He Phe Leu Arg Phe Asp Gin 
50 55 60 

Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn 
65 70 75 ^ ^ 80 

Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val 
85 90 ~ 95 

Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gin Ser Gin Thr 
100 105 HO 

Ala Lys Asn Asn He Val Ser Gin Val Tyr Leu His Gly Asp Lys Thr 
115 120 125 

Lys Pro Met He Leu Thr He Thr Leu Asn Gly Thr Ser Glu Ser Thr 
130 135 140 

Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp 
145 150 155 160 

Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr 
165 170 175 

Phe Ser Tyr He Ala Gin Glu 
180 

<210> 23 
<211> 182 
<212> PRT 

<213> human adenovirus Ad 5 
<400> 23 

Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala 
15 10 15 
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Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 30 

lie Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro 
35 40 45 

He Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu 
50 55 60 

Asn Gly Val Leu He Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn 
65 70 75 80 

Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro' Lys Ser His Gly Lys Thr 
100 105 110 

Ala Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr 
115 120 125 

Lys Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly 
130 135 140 

Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser 
145 150 155 " 160 

Gly His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Glu 
165 170 175 

Ser Tyr He Ala Gin Glu 
180 



<210> 24 

<211> 34264 

<212> DNA 

<213> simian adenovirus SV-1 

<220> 

<221> CDS 

<222> (12454) . . (13965) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (16841) . . (19636) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (28059) . . (29150) 

<223> L5 Fiber #2 
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<220> 

<221> CDS 

<222> (29183) .. (30865) 
<223> L5 Fiber #1 

<400> 24 

tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60 

gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 120 

gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180 

tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240 

atatgtactt tttatgactg ggcggaattt ctgctgatca gcagtgaact ttgggcgctg 300 

acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360 

ctcctcagcg ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420 

gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480 

acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acagatctgc 540 

tggatgttcc caactcgcct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600 

aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtatgttc cctgaatgtc 660 

tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720 

tggacttgaa atgctacgag gaatgtctgc cttctagcga ttctgaaacg gaacagacag 780 

ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840 

ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat cctgagactc 900 

tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960 

tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttattatg tgtcattatt 1020 

tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080 

ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140 

agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200 

tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260 

tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320 

gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380 

caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440 

ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500 
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cagctcgcag gcttgacgaa cgagggtctc ctcgcccgag acgtgcatga ccagcatgaa 6720 
gggtaccaac tgtttcccga acgagcccat ccatgtgtag gtttctacgt cgtaggtgac 6780 
aaagagccgc tgggtgcgcg cgtgggagcc gatcgggaag aagctgatct cctgccacca 6840 
gttggaggaa tgggtg.ttga tgtggtgaaa gtagaagtcc cgccggcgca cagagcattc 6900 
gtgctgatgt ttgtaaaagc gaccgcagta gtcgcagcgc tgcacgctct gtatctcctg 6960 

aatgagatgc gcttttcgcc cgcgcaccag aaaccggagg gggaagttga gacgggggct 7020 

tggtggggcg gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc cctccttctc 7080 

tgggtggacg acggtgggga cgacgacgcc ccgggtgccg caagtccaga tctccgccac 7140 

ggaggggcgc aggcgttgca ggaggggacg cagctgcccg ctgtccaggg agtcgagggc 7200 

ggccgcgctg aggtcggcgg gaagcgtttg caagttcact ttcagaagac cggtaagagc 7260 

gtgagccagg tgcacatggt acttgatttc caggggggtg ttggaagagg cgtccacggc 7320 

gtagaggagg ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt ttatctcact 7380 

cgtcgagggc gagcgccggg gggtagaggc ggctctgcgc cggggggcag cggaggcagt 7440 

ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag cccggagact gctggcgtgg 7500 

gcgacgacgc ggcggttgag gtcctggatg tgccgtctct gcgtgaagac caccggcccc 7560 

cgggtcctga acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt aacggcggcc 7620 

tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaac 7680 

tgctccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740 

tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800 

cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg ggccacgttg 7860 

agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag gtagttgagc 7920 

gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg 7980 

ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac ggcgaagttg 8040 

aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100 

gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160 

ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220 

cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc gccgcgccgg 8280 

cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagacgccg 8340 
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ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 

gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 

tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 8520 

ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 

ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaccacg 8640 

tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700 

cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760 

cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 

gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 

tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940 

acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000 

taggcgcggg actcgaacac gtagtcgttg catgtgcgta ccagatactg gtagccaacc 9060 

aggaagtggg gaggcggttc tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120 

gacaggtcgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9180 

ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240 

gggcgaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca atcttgtacg 9300 

ctctagatgg aaaaaagaca gggcggtcat cgactccctt ccgtagctcg gggggtaaag 9360 

tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg ccgctcccga 9420 

tgcgcctggc cccgcatcca cgacgtccgc gtcgagaccc agccgcgacg ctccgcccca 9480 

atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggcagatgcg 9540 

acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600 

ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660 

gccgtcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc aggcttttgt 9720 

gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga tgcgcgattg 9780 

caggtttcgg gcgggtagag agctgagggc gggcttcgat cgggagcggc tcctgagggc 9840 

ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc acgtctcggc 9900 

ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact tccaaaagag 9960 

ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg ggctgatgca 10020 

tctgtgggac ttcgtggagg cctacgtgca gaacccggcc agcaaacctc tgacggccca 10080 
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gctgttcctg 


atcgtgcagc 


acagccgcga 


caacgagacg 


ttccgcgacg 


ccatgttgaa 


10140 


catcgcggag 


cccgagggtc 


gctggctctt 


ggatctgatt 


aacatcctgc 


agagcatcgt 


10200 


ggtgcaggag 


aggggcctca 


gcttagcgga 


caaggtggcg 


gccattaact 


attcgatgca 


10260 


gagcctgggg 


aagttctacg 


ctcgcaagat 


ctacaagagc 


ccttacgtgc 


ccatagacaa 


10320 


ggaggtgaag 


atagacagct 


tttacatgcg 


catggcgctg 


aaggtgctga 


cgctgagcga 


10380 


cgacctcggc 


gtgtaccgta 


acgacaagat 


ccacaaggcg 


gtgagcgcca 


gccgccggcg 


10440 


ggagctgagc 


gacagggagc 


tgatgcacag 


cctgcagagg 


gcgctggcgg 


gcgccgggga 


10500 


cgaggagcgc 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


ccagcgcgcg 


10560 


cgccttggag 


gcggcgggct 


accccgacga 


ggaggatcgg 


gacgatttgg 


aggaggcagg 


10620 


cgagtacgag 


gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


gccggcggac 


10680 


ggggccaccg 


cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


gggcgtgacc 


10740 


gcctccgatg 


actgggcggc 


ggccatggac 


cgcattatgg 


cgctgactac 


ccgcaacccc 


10800 


gaggctttta 


gacagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


agcggtggtg 


10860 


ccctcccgca 


ccaaccccac 


acacgagaaa 


gtcctgacta 


tcgtgaacgc 


cctggtagac 


10920 


agcaaggcca 


tccgccgcga 


cgaggcgggc 


ttgatttaca 


acgctctgct 


ggaacgggtg 


10980 


gcgcgctaca 


acagcactaa 


cgttcagacc 


aatctggatc 


gcctcaccac 


cgacgtgaag 


11040 


gaggcgctgg 


ctcagaagga 


gcggtttctg 


agggacagca 


atctgggctc 


tctggtggca 


11100 


ctcaacgcct 


tcctgagcac 


gcagccggcc 


aacgtgcccc 


gcgggcagga 


ggactacgtg 


11160 


agcttcatca 


gcgctctgag 


gctgctggtg 


tccgaggtgc 


cccagagcga 


ggtgtatcag 


11220 


tctgggccgg 


attacttctt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


gaacctgact 


11280 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggcgttaagg 


ccccggtggg 


cgatcgagcc 


11340 


accatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


cgcgccgttc 


11400 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


tttgtaccgc 


11460 


gaggccatcg 


gtcaggctca 


gatcgacgag 


cacacatatc 


aggagatcac 


taacgtgagc 


11520 


cgggccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


tttgctaacc 


11580 


aaccggaggc 


aaaaaatacc 


ctcccagttt 


acgttaagcg 


ccgaggagga 


gaggattctg 


11640 


cgatacgtgc 


agcagtccgt 


gagtctgtac 


ttgatgcggg 


agggcgccac 


cgcttccacg 


11700 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


ccggccgttc 


11760 
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attaaccgtc 


tgatggacta 


ctt ccatcgc 


gcggccgcca 


ugaaegggga 


guacuucacc 


1 1 pon 


aatgccatcc 


ugaa uccgca 


uugga tgece 










ctgcccgaag 


ccgacgacgg 


<-*4-4-4- /~-4-4-4- rtrr 


gacgacgugu 


ccgacagcau 


uuueacgccg 


1 1 q a n 

± J. 7 ft U 


ggcaatcgcc 


gauuccagaa 


gaaggagggc 


ggagacgagc 


u ccc ecu cue 




J.ZUUU 


gcggcctcta 


ggggagagag 


tcccttt ccc 


agucugucuu 


ccgccagcag 


ugguegggua 




acgcgcccgc 


ggttgccggg 


ggagagegae 


tacctgaacg 


accccuugcu 


gcggccggcu 


1Z1ZU 


aggaagaaaa 


atttccccaa 


caaeggggtg 


gaaagcuugg 


uggauaaaau 


gaaucguugg 


i 91 fi n 

±^ laU 


aagacctacg 


cccaggagca 


gcgggagtgg 


gaggacagtc 


agccgcgacc 


gcugguuccg 


i 99 a n 


ccgcactggc 


gt eg tea gag 


agaagacccg 


gacgacuccg 


cagacgauag 


4~ a i*r ^ j^r 4™ 4~ 4~ 

uageguguug 


i 9^nn 


gacctgggag 


ggageggage 


caaccccttt 


gctcacttgc 


aacccaaggg 


gcgttccagt 


12360 


cgcctctact 


aataaaaaag 


aegeggaaac 


ttaccagagc 


catggccaca 


gcgtgtgtcc 


12420 


tttcttcctc 


tctttcttcc 


tcggcgcggc 


aga atg aga aga gcg gtg aga gtc 


12474 



Met Arg Arg Ala Val Arg Val 
1 5 

acg ccg gcg gcg tat gag ggt ccg ccc cct tct tac gaa age gtg atg 12522 
Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met 
10 15 20 

gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac gtt cct ccc 12570 
Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr Val Pro Pro 
25 30 35 

aga tac ctg gga cct acg gag ggc aga aac age ate cgt tac tec gag 12618 
Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu 
40 ^ 45 ^50 ~ 55 

ctg gca ccc ctg tac gat ace acc aag gtg tac ctg gtg gac aac aag 12666 
Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val Asp Asn Lys 
60 65 70 

teg gcg gac ate gee tec ctg aat tat caa aac gat cac age aat ttt 12714 
Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe 
75 80 ^ 85 

ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg gag gcg ggc 12762 
Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Gly 
90 95 100 

acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc ggt cag ctg 12810 
Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu 
105 110 115 

aaa acc ate ctg cac acc aac atg ccc aac ate aac gag ttc atg tec 12858 
Lys Thr lie Leu His Thr Asn Met Pro Asn lie Asn Glu Phe Met Ser 
120 125 130 135 
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acc aac aag ttc agg gcc agg ctg atg gtt aaa aag get gaa aac cag 12906 
Thr Asn Lys Phe Arg Ala Arg Leu Met Val Lys Lys Ala Glu Asn Gin 
140 145 150 

cct ccc gag tac gaa tgg ttt gag ttc acc att ccc gag ggc aac tat 12954 
Pro Pro Glu Tyr Glu Trp Phe Glu Phe Thr lie Pro Glu Gly Asn Tyr 
155 160 165 

tec gag acc atg act ate gat ctg atg aac aat gcg ate gtg gac aat 13002 
Ser Glu Thr Met Thr lie Asp Leu Met Asn Asn Ala lie Val Asp Asn 
170 175 180 

tac ctg caa gtg ggg agg cag aac ggg gta ttg gaa age gat ate ggc 13050 
Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp lie Gly 
185 190 195 

gta aaa ttt gat acc aga aac ttc cga ctg ggg tgg gat ccc gtg acc 13098 
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
200 205 210 ~ 215 

aag ctg gtg atg cca ggc gtg tac acc aac gag get ttt cac ccc gac 13146 
Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
220 225 230 

ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc act cag age cgt ttg 13194 
lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gin Ser Arg Leu 
235 240 245 

agt aac ctg tta ggg ate aga aag cgc cgc ccc ttc caa gag ggc ttt 13242 
Ser Asn Leu Leu Gly lie Arg Lys Arg Arg Pro Phe Gin Glu Gly Phe 
250 255 260 

cag ate atg tat gag gac ctg gaa gga ggt aac att cca ggt ttg eta 13290 
Gin lie Met Tyr Glu Asp Leu Glu Gly Gly Asn lie Pro Gly Leu Leu 
265 270 275 

gac gtg ccg gcg tat gaa gag agt gtt aaa cag gcg gag gcg cag gga 13338 
Asp Val Pro Ala Tyr Glu Glu Ser Val Lys Gin Ala Glu Ala Gin Gly 
280 285 290 295 

cga gag att cga ggc gac acc ttt gcc acg gaa cct cac gaa ctg gta 1338 6 
Arg Glu He Arg Gly Asp Thr Phe Ala Thr Glu Pro His Glu Leu Val 
300 305 310 

ata aaa cct ctg gaa caa gac agt aaa aaa egg agt tac aac att ata 13434 
He Lys Pro Leu Glu Gin Asp Ser Lys Lys Arg Ser Tyr Asn He He 
315 320 325 

tec ggc act atg aat acc ttg tac egg age tgg ttt ctg get tac aac 13482 
Ser Gly Thr Met Asn Thr Leu Tyr Arg Ser Trp Phe Leu Ala Tyr Asn 
330 335 * 340 

tac ggg gat ccc gaa aag gga gtg aga tea tgg acc ata etc acc acc 13530 
Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr He Leu Thr Thr 
345 350 " 355 
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acg gac gtg acc tgc ggc teg cag caa gtg tac tgg tec ctg ccg gat 13578 
Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr Trp Ser Leu Pro Asp 
360 365 370 375 

atg atg caa gac ccg gtc acc ttc cgc ccc tec acc caa gtc age aac 13626 
Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser Thr Gin Val Ser Asn 
380 385 390 

ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc cat gcc aag age ttc 13674 
Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val His Ala Lys Ser Phe 
395 400 405 

tac aac gaa cag gcc gtc tac teg caa etc att cgc cag tec acc gcg 13722 
Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu He Arg Gin Ser Thr Ala 
410 415 420 

ctt acc cac gtg ttc aat cgc ttt ccc gag aac cag att ctg gtg cgc 13770 
Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu Val Arg 
425 430 435 

cct ccc get cct acc att acc acc gtc agt gaa aac gtt ccc gcc etc 13818 
Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro Ala Leu 
440 445 450 455 

aca gat cac gga acc ctg ccg ctg cgc age agt ate agt gga gtt cag 13866 
Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Ser Gly Val Gin 
460 465 ~ 470 

cgc gtg acc ate acc gac gcc aga cgt cga acc tgt ccc tac gtt tac 13914 
Arg Val Thr He Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr 
475 480 485 

aaa get ctt ggc gta gtg get cct aaa gtg etc tct agt cgc acc ttc 13962 
Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 
490 495 500 

taa acatgtccat cctcatctct cccgataaca acaccggctg gggactgggc 14015 

teeggcaaga tgtaeggegg agecaaaagg cgctccagtc agcacccagt tegagttegg 14075 

ggccacttcc gtgctccctg gggagcttac aagegaggae tctcgggccg aacggcggta 14135 

gacgatacca tagatgeegt gattgecgae gcccgccggt acaaccccgg accggtcgct 14195 

agcgccgcct ccaccgtgga ttccgtgatc gacagcgtgg tagctggege tegggectat 14255 

gctcgccgca agaggegget geateggaga cgtcgcccca ccgccgccat gctggcagcc 14315 

agggccgtgc tgaggeggge ccggagggta ggcagaaggg ctatgcgccg cgctgccgcc 14375 

aacgccgccg cegggaggge ccgccgacag gctgcccgcc aggctgetge cgccatcgct 14435 

ageatggeca gacccaggag agggaaegtg tactgggtgc gcgattctgt gaegggagtc 14495 

egagtgeegg tgcgcagccg acctccccga agttagaaga tccaagctgc gaagacggcg 14555 

gtactgagtc tccctgttgt tatcagccca acatgagcaa gcgcaagttt aaagaagaac 14615 
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tgctgcagac 


gctggtgcct 


gagatctatg 


gccctccgga 


cgtgaagcct 


gacattaagc 


14675 


cccgcgatat 


caagcgtgtt 


aaaaagcggg 


aaaagaaaga 


ggaactcgcg 


gtggtagacg 


14735 


atggcggagt 


ggaatttatt 


aggagtttcg 


ccccgcgacg 


cagggttcaa 


tggaaagggc 


14795 


ggcgggtaca 


acgcgttttg 


aggccgggca 


ccgcggtagt 


ttttaccccg 


ggagagcggt 


14855 


cggccgttag 


gggtttcaaa 


aggcagtacg 


acgaggtgta 


cggcgacgag 


gacatattgg 


14915 


aacaggcggc 


tcaacagatc 


ggagaatttg 


cctacggaaa 


gcgttcgcgt 


cgcgaagacc 


14975 


tggccatcgc 


tttagacagc 


ggcaacccca 


cgcccagcct 


caaacctgtg 


acgctgcagc 


15035 


aggtgctccc 


cgtgagcgcc 


agcacggaca 


gcaagagggg 


aataaaaaga 


gaaatggaag 


15095 


atctgcagcc 


caccatccag 


ctcatggtcc 


ctaaacggca 


gaggctggaa 


gaggtcctgg 


15155 


agaaaatgaa 


agtggaccca 


agcatagagc 


cggacgtcaa 


agtcaggccg 


atcaaagaag 


15215 


tggcccctgg 


tctcggggtg 


cagacggtgg 


atatccagat 


ccccgtcacg 


tcagcttcga 


15275 


ccgccgtgga 


agccatggaa 


acgcaaacgg 


aaacccctgc 


cgcgatcggt 


accagggaag 


15335 


tggcgttgca 


aaccgacccc 


tggtacgaat 


acgccgcccc 


tcggcgtcag 


aggcgacccg 


15395 


ctcgttacgg 


ccccgccaac 


gccatcatgc 


cagaatatgc 


gctgcatccg 


tctatcctgc 


15455 


ccacccccgg 


ctaccgggga 


gtgacgtatc 


gcccgtcagg 


aacccgccgc 


cgaacccgtc 


15515 


gccgccgccg 


ctcccgtcgt 


gctctggccc 


ccgtgtcggt 


gcgccgcgta 


acacgccggg 


15575 


gaaagacagt 


taccattccc 


aacccgcgct 


accaccctag 


catcctttaa 


tgactctgcc 


15635 


gttttgcaga 


tggctctgac 


ttgccgcgtg 


cgccttcccg 


ttccgcacta 


tcgaggaaga 


15695 


tctcgtcgta 


ggagaggcat 


ggcgggtagt 


ggtcgccggc 


gggctttgcg 


caggcgcatg 


15755 


aaaggcggaa 


ttttacccgc 


tctgataccc 


ataatcgccg 


ccgccatcgg 


tgccataccc 


15815 


ggcgtcgctt 


cagtggcctt 


gcaagcagct 


cgtaataaat 


aaacgaaggc 


ttttgcactt 


15875 


atgtcctggt 


cctgactatt 


ttatgcagaa 


agagcatgga 


agacatcaat 


tttacgtcgc 


15935 


tggctccgcg 


gcacggctcg 


cggccgctca 


tgggcacctg 


gaacgacatc 


ggcaccagtc 


15995 


agctcaacgg 


gggcgctttc 


aattggggga 


gcctttggag 


cggcattaaa 


aactttggct 


16055 


ccacgattaa 


atcctacggc 


agcaaagcct 


ggaacagtag 


tgctggtcag 


atgctccgag 


16115 


ataaactgaa 


ggacaccaac 


ttccaagaaa 


aagtggtcaa 


tggggtggtg 


accggcatcc 


16175 


acggcgcggt 


agatctcgcc 


aaccaagcgg 


tgcagaaaga 


gattgacagg 


cgtttggaaa 


16235 


gctcgcgggt 


gccgccgcag 


agaggggatg 


aggtggaggt 


cgaggaagta 


gaagtagagg 


16295 



125 



WO 03/046124 



PCT/US02/33645 



aaaagctgcc 


cccgctggag 


aaagttcccg 


gtgcgcctcc 


gagacegcag 


aagcgaccca 


16355 


ggccagaact 


agaagaaact 


ctggtgacgg 


agagcaagga 


gcctccctcg 


tacgagcaag 


16415 


ccttgaaaga 


gggcgcctct 


ccaccctacc 


caatgacaaa 


accgatcgcg 


cctatggctc 


16475 


ggccggtgta 


cgggaaggac 


tacaagcctg 


teaegctaga 


gctccccccg 


ccgccaccgc 


16535 


cgccccccac 


gcgcccgacc 


gttccccccc 


ccctgccggc 


tccgtcggcg 


gqacccatat 


16595 


ccgcacccgt 


cgccgtgcct 


ctgccagccg 


cccgcccagt 


ggccgtggcc 


actcfccaaaa 


16655 


accccagagg 


ccagagagga 


gccaactggc 


aaagcacget 


gaacagcatc 


gtgggcctgg 


16715 


gagtgaaaag 


cctgaaacgc 


cgccgttgct 


attattaaaa 


gtgtagctaa 


aaaatttccc 


16775 


gttgtatacg 


cctcctatgt 


taccgccaga 


gaegegtgae 


tgtcgccgcg 


agcgccgctt 


16835 


tcaag atg gcc acc cca 
Met Ala Thr Pro 
505 


teg atg atg 
Ser Met Met 
510 


ccg cag tgg tct tac atg cac ate 
Pro Gin Trp Ser Tyr Met His He 
515 


16885 



gcc ggg cag gac gcc teg gag tac ctg age ccc ggt etc gtg cag ttc 16933 
Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe 
520 525 530 

gcc cgc gcc acc gac acc tac ttc age ttg gga aac aag ttt aga aac 16981 
Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn 
535 540 545 550 

ccc acc gtg gcc ccc acc cac gat gta acc acg gac cgc teg caa agg 17029 
Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg 
555 560 " 565 

ctg acc ctg cgt ttt gtg ccc gta gac egg gag gac acc gcg tac tct 17077 
Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser 
570 575 J 580 

tac aaa gtg cgc tac acg ctg gcc gta ggg gac aac cga gtg ctg gac 17125 
Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp 
585 590 595 

atg gcc age acc tac ttt gac ate egg gga gtg ctg gat cgc ggt ccc 17173 
Met Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro 
600 605 610 

agt ttt aag ccc tac teg ggt acc gcg tac aat tec ctg get ccc aag 17221 
Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys 
615 620 625 630 

ggc get ccc aac cct gca gaa tgg acg aat tea gac age aaa gtt aaa 17269 
Gly Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys 
635 640 645 

gtg agg gca cag gcg cct ttt gtt age teg tat ggt get aca gcg att 17317 
Val Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala He 
650 655 " 660 
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aca aaa gag ggt att cag gtg gga gta acc tta aca gac tec gga tea 

Thr Lys Glu Gly He Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser 

665 670 675 

aca cca cag tat gca gat aaa acg tat cag cct gag ccg caa att gga 

Thr Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin He Gly 

680 685 690 



aga gtg eta aag aaa aca acg ccc atg ttc cct tgt tac ggc tea tat 
Arg Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr 
715 720 ~ 725 



caa gac gtg caa aat ccc gaa tta caa ttt ttt gee tct act aat gtc 
Gin Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val 
745 750 755 



17365 



17413 



gaa eta cag tgg aac age gat gtt gga acc gat gac aaa ata gca gga 17461 
Glu Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys He Ala Gly 
6 95 700 705 710 



17509 



gee agg ccc act aat gaa aaa gga gga cag gca aca ccg tec get agt 17557 
Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser 
730 735 740 



17605 



gee aat aca cca aaa gca gtt eta tat gcg gag gac gtg tea att gaa 17653 
Ala Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser He Glu 
760 765 770 

gcg cca gac act cac ttg gtg ttc aaa cca aca gtc act gaa ggc att 17701 
Ala Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly He 
775 780 785 790 

aca agt tea gag get eta ctg acc caa caa get get ccc aac cgt cca 17749 
Thr Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro 
795 800 805 

aac tac ata gee ttt aga gat aat ttt att ggt etc atg tac tac aat 17797 
Asn Tyr He Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn 
810 815 ~ 820 

age aca ggt aac atg gga gta ctg gca ggc cag get tct cag eta aat 17845 
Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn 
825 830 835 

gca gtt gtt gac ctg caa gac aga aat act gag ctg tec tac caa etc 17893 
Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu 
840 845 850 

atg ttg gac gee etc gga gac cgc agt egg tac ttt tct atg tgg aac 17941 
Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn 
855 860 865 870 

caa get gtg gat agt tac gat cct gat gta aga ate ata gaa aac cat 17989 
Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His 
875 880 885 
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ggc gta gaa gat gaa ttg cct aat tat tgc ttt cct ttg gga ggc atg 
Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met 
890 895 900 

gca gta acc gac acc tac teg cct ata aag gtt aat gga gga ggc aat 
Ala Val Thr Asp Thr Tyr Ser Pro He Lys Val Asn Gly Gly Gly Asn 
905 910 915 

gga tgg gaa gcc aat aac ggc gtt ttc acc gaa aga gga gtg gaa ata 
Gly Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu He 
920 925 930 

ggt tea ggg aac atg ttt gcc atg gag att aac ctg caa gcc aac eta 
Gly Ser Gly Asn Met Phe Ala Met Glu He Asn Leu Gin Ala Asn Leu 
935 940 945 950 

tgg cgt age ttt ctg tac tec aat att ggg ctg tac ctg cca gac tct 
Trp Arg Ser Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser 
955 960 965 

etc aaa ate act cct gac aac ate aca etc cca gag aac aaa aac acc 
Leu Lys He Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr 
970 975 980 

tat cag tat atg aac ggt cgc gtg acg cca ccc ggg ctg gtt gac acc 
Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr 
985 990 995 

tac gtt aac gtg ggc gcg cgc tgg tec ccc gat gtc atg gac agt 
Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser 
1000 1005 1010 

att aac cct ttt aat cac cac cgc aac gcc gga etc cgc tac cgt 
He Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
1015 1020 1025 

tec atg etc ctg gga aac gga cgc tac gtg ccc ttc cac ate cag 
Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin 
1030 1035 1040 

gtg ccc cag aaa ttc ttt gca att aaa aac ctg ctg ctg etc ccc 
Val Pro Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro 
1045 1050 1055 

ggt tec tac acc tac gag tgg aac ttc cgc aag gac gtg aac atg 
Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
1060 1065 1070 

ate ttg cag age teg ctg ggc aat gac ctg cga gtg gac ggg gcc 
He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala 
1075 1080 1085 



18037 



18085 



18133 



18181 



18229 



18277 



18325 



18370 



18415 



18460 



18505 



18550 



18595 



age ate cgc ttc gac age ate aac ctg tac gcc aac 
Ser He Arg Phe Asp Ser He Asn Leu Tyr Ala Asn 
1090 1095 HOO 



ttt ttc ccc 
Phe Phe Pro 



18640 
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atg gcc cac aac acg gcc tec acc ctg gaa gec atg 
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met 
1105 mo 1115 



ctg cgc aac 
Leu Arg Asn 



18685 



gac acc aac gac caa tct ttc aac gac tac ctg tgc gcg gcc aac 
Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn 
1120 1125 1130 



18730 



atg ctg tac ccc ate ccc gcc aac gcc acc age gtg ccc ate tec 
Met Leu Tyr Pro lie Pro Ala Asn Ala Thr Ser Val Pro lie Ser 
1135 1140 1145 



18775 



att ccc tct cgc aac tgg gca gcc ttc agg ggc tgg agt ttc acc 18820 

lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr 

1150 1155 1160 

cgc etc aaa acc aag gag acc ccc teg ctg ggc tec ggg ttc gac 18865 

Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp 

1165 1170 " 1175 



ccc tac ttc gtc tac tec ggc tec ate ccc tac ctg gac ggc acc 
Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu Asp Gly Thr 
1180 1185 1190 



18910 



ttc tac etc aac cat act ttc aaa aag gtg tea ate atg ttc gac 
Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie Met Phe Asp 
1195 1200 1205 



18955 



tec tec gtc age tgg ccc ggc aac gac cgt ctg ctg 
Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 
1210 1215 1220 



acg ccc aac 
Thr Pro Asn 



19000 



gag ttc gaa ate aag cgt teg gtg gac ggt gaa ggg 
Glu Phe Glu lie Lys Arg Ser Val Asp Gly Glu Gly 
1225 1230 " 1235 



tac aac gtg 
Tyr Asn Val 



19045 



get cag age aac atg acc aag gac tgg ttc ctg att cag atg etc 
Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met Leu 
1240 1245 1250 



19090 



age cac tac aac ate ggc tac cag ggc ttc tac gtg ccc gaa aat 
Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn 
1255 1260 1265 



19135 



tac aag gac cgc atg tac tct ttc ttc aga aac ttc caa ccc atg 19180 

Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met 
1270 1275 1280 

age cgc caa att gta gat tea acg get tac act aat tat cag gat 19225 

Ser Arg Gin He Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp 
1285 1290 1295 



gtg aaa ctg cca tac cag cat aac aac tea ggg ttc gtg ggc tac 
Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 
1300 1305 ^ 1310 



19270 
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atg gga ccc acc atg cga gag ggg cag gcc tac ccg gcc aac tat 19315 
Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr 
1315 1320 1325 

ccc tat ccc ctg att ggg gcc acc gcc gtg ccc age etc acg cag 19360 
Pro Tyr Pro Leu lie Gly Ala Thr Ala Val Pro Ser Leu Thr Gin 
1330 1335 1340 

aaa aag ttc etc tgc gac egg gtg atg tgg agg ate ccc ttc tct 19405 
Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg lie Pro Phe Ser 
1345 1350 1355 

age aac ttc atg tct atg ggc tec etc acc gac ctg ggg cag aac 19450 
Ser Asn Phe Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn 
1360 1365 1370 

atg ctg tac gcc aac tec get cac gcc ttg gat atg acc ttt gag 19495 
Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu 
1375 1380 1385 

gtg gat ccc atg gat gag ccc acg ctt etc tat gtt ctg ttt gaa 19540 
Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu 
1390 1395 1400 

gtc ttc gac gtg gtg cgc ate cac cag ccg cac cgc ggc gtc ate 19585 
Val Phe Asp Val Val Arg He His Gin Pro His Arg Gly Val He 
1405 1410 1415 

gag gcc gtc tac ctg cgc aca cct ttc tct gcc ggt aac gcc acc 19630 
Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr 
1420 ~ 1425 1430 

acc taa agaagecgat gggctccagc gaacaggagc tgeaggecat tgttcgcgac 19686 
Thr 



ctgggctgcg 


ggccctactt 


tttgggcacc 


ttcgacaagc 


gttttccegg 


cttcatgtcc 


19746 


ccccacaagc 


cggcctgtgc 


categttaac 


acggccggac 


gggagacegg 


gggggtccac 


19806 


tggctcgcct 


tcgcctggaa 


cccgcgtaac 


cgcacctgct 


acctgttcga 


cccttttggt 


19866 


ttctccgacg 


aaaggctgaa 


gcagatctac 


cagttcgagt 


acgaggggct 


cctcaagcgc 


19926 


agegctctgg 


cctccacgcc 


cgaccactgc 


gtcaccctgg 


aaaagtccac 


ccaaacggtc 


19986 


caggggcccc 


tctcggccgc 


ctgcgggctc 


ttctgttgca 


tgtttttgca 


cgccttcgtg 


20046 


cactggcctc 


acacccccat 


ggatcacaac 


cccaccatgg 


atctgctcac 


cggagtgccc 


20106 


aacagcatgc 


ttcacagccc 


ccaggtcgcc 


cccaccctgc 


gccgtaacca 


ggaacacctg 


20166 


tategcttte 


tggggaaaca 


ctctgcctat 


tttcgccgcc 


accggcagcg 


catcgaaegg 


20226 


gccacggcct 


tcgaaagcat 


gagecaaaga 


gtgtaatcaa 


taaaaaacat 


ttttatttga 


20286 


catgataege 


gcttctggcg 


ttttattaaa 


aatcgaaggg 


ttcgagggag 


gggtcctcgt 


20346 
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gcccgctggg gagggacacg ttgcgatact ggaaacgggc gctccaacga aactcgggga 20406 

tcaccagccg cggcaggggc acgtcttcta ggttctgctt ccaaaactgc cgcaccagct 20466 

gcagggctcc catgacgtcg ggcgccgata tcttgaagtc gcagttaggg ccggagctcc 20526 

cgcggctgtt gcggaacacg gggttggcac actggaacac cagcacgccg gggttgtgga 20586 

tactggccag ggccgtcggg tcggtcacct ccgacgcatc cagatcctcg gcgttgctca 20646 

gggcaaacgg ggtcagcttg cacatctgcc gcccaatctg • gggtactagg tcgcgcttgt 20706 

tgaggcagtc gcagcgcaga gggatcagga tgcgtcgctg cccgcgttgc atgatagggt 20766 

aactcgccgc caggaactcc tccatttgac ggaaggccat ctgggctttg ccgccctcgg 20826 

tgtagaatag cccgcaggac ttgctagaga atacgttatg accgcagttg acgtcctccg 20886 

cgcagcagcg ggcgtcttcg ttcttcagct gaaccacgtt gcggccccaa cggttctgga 20946 

ccaccttggc tctagtgggg tgctccttca gcgcccgctg tccgttctcg ctggttacat 21006 

ccatttccaa cacgtgctcc ttgcagacca tctccactcc gtggaagcaa aacaggacgc 21066 

cctcctgctg ggtactgcga tgctcccata cggcgcatcc ggtgggctcc cagctcttgt 21126 

gttttacccc cgcgtaggct tccatgtaag ccataaggaa tctgcccatc agctcggtga 21186 

aggtcttctg gttggtgaag gttagcggca ggccgcggtg ctcctcgttc aaccaagttt 21246 

gacagatctt gcggtacacc gctccctggt cgggcagaaa cttaaaagcc gctctgctgt 21306 

cgttgtctac gtggaacttc tccattaaca tcatcatggt ttccataccc ttctcccacg 21366 

ctgtcaccag tggtttgctg tcggggttct tcaccaacac ggcggtagag gggccctcgc 21426 

cggccccgac gtccttcatg gtcattcttt gaaactccac ggagccgtcc gcgcgacgta 2148 6 

ctctgcgcac cggagggtag ctgaagccca cctccaccac ggtgccttcg ccctcgctgt 21546 

cggagacaat ctccggggat ggcggcggcg cgggtgtcgc cttgcgagcc ttcttcttgg 21606 

gagggagctg aggcgcctcc tgctcgcgct cggggctcat ctcccgcaag tagggggtaa 21666 

tggagctgcc tgcttggttc tgacggttgg ccattgtatc ctaggcagaa agacatggag 21726 

cttatgcgcg aggaaacttt aaccgccccg tcccccgtca gcgacgaaga tgtcatcgtc 21786 

gaacaggacc cgggctacgt tacgccgccc gaggatctgg aggggcctga ccggcgcgac 21846 

gctagtgagc ggcaggaaaa tgagaaagag gaggcctgct acctcctgga aggcgacgtt 21906 

ttgctaaagc atttcgccag gcagagcacc atagttaagg aggccttgca agaccgctcc 21966 

gaggtgccct tggacgtcgc cgcgctctcc caggcctacg aggcgaacct tttctcgcct 22026 
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cgagtgcctc 


cgaagagaca 


gcccaacggc 


acctgcgagc 


ccaacccgcg 


actcaacttc 


22086 


taccccgtgt 


tcgccgtacc 


agaqqcgctg 

3 3315 3 


gccacctatc 


acattttttt 


caaaaaccaa 


22146 


cgcatccccc 


tatcgtgccg 


ggccaaccgc 


accgcggccq 

3 33 3 


ataggaatct 


caggcttaaa 


22206 


aacggagcca 


acatacctga 


tatcacgtcg 


ctqgacraaaa 

*" 3 3 3 3 52 3 


tacccaacrat 


tttcgagggt 


22266 


ctaqatcaacr 


atoaaaaaca 


aacaacaaac 


actctacaaa 


a a era a caaaa 


a era cr a cr t ca rr 


223P6 

£1 £m O £> \J 


aacgtqctqg 


tggaqctqqa 


aacrgqacaac 

3 3 333 ,- * wv -* , -*' v -* 


qcqcqtctcfa 

3 ***3 v "3 


ccgtcct caa 


acactacata 


22386 


gaagtctccc 


acttcgccta 


ccccgccctc 


aacttgccac 


ccaaagttat 


gaaatcggtc 


22446 


atggatcagc 


tgctcatcaa 


gagagctgag 

3 3 3 3 3 


cccctggatc 


ccgaccaccc 


cqaqacaaaa 


22506 


aactcagagg 


acggaaagcc 


cgtcgtcagc 


aacqaaaaac 


t caacrcaata 

3 "3 ^33 ° 3 


octcrcfaaacc 


22566 


agggaccccc 


aacagttgca 


aaaaaaacac 

*■* 3 *•* 3 ** 3 3 w 3 w 


aagatgatga 


taccaoccat 


qefcerert cacc 


22626 


gtqqaqctqq 


aatgcctgca 


acggtttttc 


agcgacgtgg 


aaaccrctaccr 


ca a a a tccrcrej 


22686 


gaatccctgc 


actacacctt 


ccaccaaaac 


tacatccacc 


aacrcct crcaa 


cfatctrraar 


221 AS 


gtggagctca 

3 3 3 3 


gcaacctggt 


ctcctacatg 


ggcatcctcc 


acgagaaccg 


actaaaacaa 


22806 

fa* fa* V V V 


agcgtgctgc 
3 33 3 


actgcacctt 


qcaaqqcqaq 

3 w * -,wl 33 3 fc *3 


qcqcqqcqqq 

3 w 3 3 3 w 3 3 3 


actacgtgcg 


agactgeate 


22866 


tacctcttcc 


tcaccctcac 


ctggcagacc 


accataaaca 


tctggcagca 


gtgcttggaa 


22926 


gagagaaacc 


tcaaagagct 


agacaaactc 


ctctgccgcc 


aacaciccicac 

>H w \A \M \^ u W 


cctcrtcrcrtcc 


22986 


ggtttcagcg 


agcgcacggt 

3 3 ^> 3 


cgccagcgct 


ctggcggaca 


tcatcttccc 


oaacrcoccta 

3 3 a 3 ^y^vuy 


23046 


atgaaaacct 


tgcaaaacgg 


cctgccggat 


ttcatcagtc 


aaagcatttt 


acaaaacttc 

XJ VUU V* V»* W fa* 


23106 


cgctcttttg 


tcctggaacg 


ctccgggatc 


ttgcccgcca 


tgagctgege 


gctaccttct 


23166 


gactttgtcc 


ccctctccta 


ccqcqaqtqc 


cctcccccac 


tgtggagcca 


ctgctacctc 


23226 


ttccaactgg 


ccaactttct 


ggcctaccac 


tccgacctca 


tqqaaqacqt 

i *33 w,,fc *3" ,w 3 ^ 


aaacaaaaaa 


23286 


ggtttactgg 


agtgccactg 


ccgctgcaac 


ctgtgcaccc 


cccacagatc 


actaacctac 

y\*wyyovv.y^ 


23346 


aacaccgagc 


tactcagcga 


aacccaggtc 


ataggtacct 


tcgagat cca 


cjcfacrccccacf 


23406 


cagcaagagg 

3 333 


gtgcttccgg 

3 3 3 3 


cttgaagctc 


actccggcgc 


tgtggacctc 


ggcttactta 


23466 


cgcaaatttg 


taqccaaaaa 

*"**—3 w 3 3 3 ^ 


ctaccacgcc 


cacaaaattc 


agttttacga 


agaccaat ct 


23526 


cgaccaccga 


aagcccccct 


cacggcctgc 


gtcatcaccc 


agagcaagat 


cctggcccaa 


23586 


ttgcaatcca 


tcaaccaagc 


gcgccgcgat 


ttccttttga 


aaaagggtcg 


gggggtgtac 


23646 


ctggaccccc 


agaccggcga 


ggaactcaac 


ccgtccacac 


tctccgtcga 


agcagccccc 


23706 


ccgagacatg 


ccgcccaagg 


gaaccgccaa 


gcagctgatc 


geteggcaga 


gagegaagaa 


23766 
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gcaagagctg ctccagcagc aggtggagga cgaggaagag atgtgggaca gccaggcaga 23826 
ggaggtgtca gaggacgagg aggagatgga aagctgggac agcctagacg aggaggagga 23886 
cgagctttca gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct 23946 
gagccgacag ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg 24006 
taggtgggac gccaccgaat ctccagcggc agcggcaacg gcagcgggta aggccaaacg 24066 
cgagcggcgg gggtattgct cctggcgggc ccacaaaagc agtattgtga actgcttgca 24126 
acactgcggg ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc 24186 
cttccctcgc aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg 24246 
agaaaaaagc taaggcctcc tccgccgcga ggaaaaactc cgccgccgct gccgccgcca 24306 
aggatccacc ggccaccgaa gagctgagaa agcgcatctt tcccactctg tatgctatct 24366 
ttcagcaaag ccgcgggcag caccctcagc gcgaactgaa aataaaaaac cgctccttcc 24426 
gctcgctcac ccgcagctgt ctgtaccaca agagagaaga ccagctgcag cgcaccctgg 24486 
acgacgccga agcactgttc agcaaatact gctcagcgtc tcttaaagac taaaagaccc 24546 
gcgctttttc cccctcggcc gccaaaaccc acgtcatcgc cagcatgagc aaggagattc 24606 
ccacccccta catgtggagc tatcagcccc agatgggcct ggccgcgggg gccgcccagg 24666 
actactccag caagatgaac tggctcagcg ccggccccca catgatctca cgagttaacg 24726 
gcatccgagc ccaccgaaac cagattctct tagaacaggc ggcaatcacc gccacacccc 24786 
ggcgccaact caacccgcct agttggcccg ccgcccaggt gtatcaggaa aatccccgcc 24846 
cgaccacagt cctcctgcca cgcgacgcgg aggccgaagt cctcatgact aactctgggg 24906 
tacaattagc gggcgggtcc aggtacgcca ggtacagagg tcgggccgct ccttactctc 24966 
ccgggagtat aaagagggtg atcattcgag gccgaggtat ccagctcaac gacgagacgg 25026 
tgagctcctc aaccggtctc agacctgacg gagtcttcca gctcggagga gcgggccgct 25086 
cttccttcac cactcgccag gcctacctga ccctgcagag ctcttcctcg cagccgcgct 25146 
ccgggggaat cggcactctc cagttcgtgg aagagttcgt tccctccgtc tacttcaacc 25206 
ccttctccgg ctcgcctgga cgctacccgg acgccttcat tcccaacttt gacgcagtga 25266 
gtgaatccgt ggacggctac gactgatgac agatggtgcg gccgtgagag ctcggctgcg 25326 
acatctgcat cactgccgtc agcctcgctg ctacgctcgg gaggcgatcg tcttcagcta 25386 
ctttgagctg ccggacgagc accctcaggg tccggctcac gggttgaaac tcgagatcga 25446 
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gaacgcgctc gagtctcgcc tcatcgacac cttcaccgcc cgacctctcc tggtagaaat 25506 

ccaacggggg atcactacca tcaccctgtt ctgcatctgc cccacgcccg gattacatga 25566 

agatctgtgt tgtcatcttt gcgctcagtt taataaaaac tgaacttttt gccgcacctt 25626 

caacgccatc tgtgatttct acaacaaaaa gttcttctgg caaaggtaca caaactgtat 25686 

tttattctaa ttctacctca tctatcgtgc tgaactgcgc ctgcactaac gaacttatcc 25746 

agtggattgc aaacggtagt gtgtgcaagt acttttgggg gaacgatata gttagtagaa 25806 

ataacagcct ttgcgagcac tgcaactcct ccacactaat cctttatccc ccatttgtta 25866 

ctggatggta tatgtgcgtt ggctccggtt taaatcctag ttgctttcat aagtggtttc 25926 

tacaaaaaga gacccttccc aacaattctg tttctttttt cgccctatcc tactgctgtt 25986 

ctccctctgg ttactctttc aaacctctaa ttggtatttt agctttgata ctcataatct 26046 

ttattaactt tataataatt aacaacttac agtaaacatg cttgttctac tgctcgccac 26106 

atctttcgct ctctctcacg ccagaacaag tattgttggc gcaggttaca atgcaactct 26166 

tcaatctgct tacatgccag attccgacca gataccccat attacgtggt acttacaaac 26226 

ctccaaacct aattcttcat tttatgaagg aaacaaactc tgcgatgact ccgacaacag 26286 

aacgcacaca tttccccacc cttcactaca attcgaatgc gtaaacaaaa gcttgaagct 26346 

ttacaactta aagccttcag attctggctt gtaccatgct gtagttgaaa aaagtaattt 26406 

agaagtccac agtgattaca ttgaattgac ggttgtggac ctgccacctc caaaatgtga 26466 

ggtttcctcc tcttaccttg aagttcaagg cgtggatgcc tactgcctca tacacattaa 26526 

ctgcagcaac tctaaatatc cagctagaat ttactataat ggacaggaaa gtaatctttt 26586 

ttattattta acaacaagcg ctggtaacgg taaacagtta cctgactatt ttactgctgt 26646 

tgttgaattt tccacctaca gagaaacgta tgccaagcgg ccttacaatt tctcataccc 26706 

gtttaacgac ctttgcaatg aaatacaagc gctcgaaact ggaactgatt ttactccaat 26766 

tttcattgct gccattgttg taagcttaat taccattatt gtcagcctag cattttactg 26826 

cttttacaag cccaaaaacc ctaagtttga aaaacttaaa ctaaaacctg tcattcaaca 26886 

agtgtgattt tgttttccag catggtagct gcatttctac ttctcctctg tctacccatc 26946 

attttcgtct cttcaacttt cgccgcagtt tcccacctgg aaccagagtg cctaccgcct 27006 

tttgacgtgt atctgattct cacctttgtt tgttgtatat ccatttgcag tatagcctgc 27066 

ttttttataa caatctttca agccgccgac tatttttacg tgcgaattgc ttactttaga 27126 

caccatcctg aatacagaaa tcaaaacgtt gcctccttac tttgtttggc atgattaagt 27186 



134 



WO 03/046124 



PCT/US02/33645 



tattgctgat acttaattat ttacccctaa tcaactgtaa ttgtccattc accaaaccct 27246 

ggtcattcta cacctgttat gataaaatcc ccgacactcc tgttgcttgg ctttacgcag 27306 

ccaccgccgc tttggtattt atatctactt gccttggagt aaaattgtat tttattttac 27366 

acactgggtg gctacatccc agagaagatt tacctagata tcctcttgta aacgcttttc 27426 

aattacagcc tctgcctcct cctgatcttc ttcctcgagc tccctctatt gtgagctact 27486 

ttcaactcac cggtggagat gactgactct caggacatta atattagtgt ggaaagaata 27546 

gctgctcagc gtcagcgaga aacgcgagtg ttggaatacc tggaactaca gcaacttaaa 27606 

gagtcccact ggtgtgagaa aggagtgctg tgccatgtta agcaggcagc cctttcctac 27666 

gatgtcagcg ttcagggaca tgaactgtct tacactttgc ctttgcagaa acaaaccttc 27726 

tgcaccatga tgggctctac ctccatcaca atcacccaac aagccgggcc tgtagagggg 27786 

gctatcctct gtcactgtca cgcacctgat tgcatgtcca aactaatcaa aactctctgt 27846 

gctttaggtg atatttttaa ggtgtaaatc aataataaac ttaccttaaa tttgacaaca 27906 

aatttctggt gacatcattc agcagcacca ctttaccctc ttcccagctc tcgtatggga 27966 

tgcgatagtg ggtggcaaac ttcctccaaa ccctaaaaga aatattggta tccacttcct 28026 

tgtcctcacc cacaattttc atcttttcat ag atg aaa aga acc aga gtt gat 28079 

Met Lys Arg Thr Arg Val Asp 
1435 1440 

gaa gac ttc aac ccc gtc tac ccc tat gac acc aca acc act cct 28124 
Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Thr Thr Thr Thr Pro 
1445 1450 1455 

gca gtt ccc ttt ata tea ccc ccc ttt gta aac age gat ggt ctt 28169 
Ala Val Pro Phe He Ser Pro Pro Phe Val Asn Ser Asp Gly Leu 
14 60 1465 1470 

cag gaa aac ccc cca ggt gtt tta agt ctg cga ata get aaa ccc 28214 
Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg He Ala Lys Pro 
1475 1480 1485 

eta tat ttc gac atg gag aga aaa eta gee ctt tea ctt gga aga 28259 
Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser Leu Gly Arg 
1490 1495 1500 

ggg ttg aca att acc gee gee gga caa tta gaa agt acg cag age 28304 
Gly Leu Thr He Thr Ala Ala Gly Gin Leu Glu Ser Thr Gin Ser 
1505 1510 1515 

gta caa acc aac cca ccg ttg ata att acc aac aac aac aca ctg 28349 
Val Gin Thr Asn Pro Pro Leu He He Thr Asn Asn Asn Thr Leu 
I 520 1525 1530 
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acc eta cgt cat 
Thr Leu Arg His 
1535 



tct ccc ccc tta aac eta act gac aat age tta 
Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 
1540 1545 



28394 



gtg eta ggc tac teg agt cct etc cgc gtc aca gac aac aaa ctt 
Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu 
1550 1555 1560 



28439 



aca ttt aac ttc aca tea cca etc cgt tat gaa aat gaa aac ctt 
Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu 
1565 1570 " 1575 



28484 



act ttt aac tat aca gag cct ctt aaa ctt ata aat aac age ctt 
Thr Phe Asn Tyr Thr Glu Pro Leu Lys Leu lie Asn Asn Ser Leu 
1580 1585 1590 



28529 



gee att gac ate aat tec tea aaa ggc ctt agt age gtc gga ggc 
Ala lie Asp lie Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly 
1595 1600 1605 



28574 



tea eta get gta aac ctg agt tea gac tta aag ttt gac age aac 
Ser Leu Ala Val Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn 
1610 1615 1620 



28619 



gga tec ata get ttt ggc ata caa acc ctg tgg acc get ccg acc 
Gly Ser lie Ala Phe Gly lie Gin Thr Leu Trp Thr Ala Pro Thr 
1625 1630 1635 



28664 



teg act ggc aac tgc acc gtc tac age 
Ser Thr Gly Asn Cys Thr Val Tyr Ser 
1640 1645 



gag ggc gat tec eta ctt 
Glu Gly Asp Ser Leu Leu 
1650 



28709 



agt etc tgt tta acc aaa tgc gga get cac gtc tta gga agt gta 
Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly Ser Val 
1655 1660 1665 



28754 



agt tta acc ggt tta aca gga acc ata acc caa atg act gat att 28799 
Ser Leu Thr Gly Leu Thr Gly Thr He Thr Gin Met Thr Asp He 
1670 1675 1680 

tct gtc acc att caa ttt aca ttt gac aac aat ggt aag eta eta 28844 
Ser Val Thr He Gin Phe Thr Phe Asp Asn Asn Gly Lys Leu Leu 
1685 1690 ~ 1695 



age tct cca ctt 
Ser Ser Pro Leu 
1700 



ata aac aac gec ttt agt att cga cag aat gac 
He Asn Asn Ala Phe Ser He Arg Gin Asn Asp 
1705 1710 



28889 



agt acg gee tea aac cct acc tac aac gec ctg gcg ttt atg cct 
Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu Ala Phe Met Pro 
1715 1720 1725 



28934 



aac agt acc ata tat gca aga ggg gga 
Asn Ser Thr He Tyr Ala Arg Gly Gly 
1730 - 1735 



ggt ggt gaa cca cga aac 
Gly Gly Glu Pro Arg Asn 
1740 



28979 
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aac tac tac gtc 
Asn Tyr Tyr Val 
1745 



caa acg tat ctt agg gga aat gtt caa aaa cca 
Gin Thr Tyr Leu Arg Gly Asn Val Gin Lys Pro 
1750 1755 



ate att ctt act gta acc tac aac tea gtc gee aca gga tat tec 
He He Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr Ser 
1760 1765 1770 

tta tct ttt aag tgg act get ctt gca cgt gaa aag ttt gca acc 
Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 
1775 1780 1785 

cca aca acc teg ttt tgc tac att aca gaa caa taa aaccgtgtac 
Pro Thr Thr Ser Phe Cys Tyr He Thr Glu Gin 
1790 1795 

cccaccgttt cgtttttttc ag atg aaa egg gcg aga gtt gat gaa gac 

Met Lys Arg Ala Arg Val Asp Glu Asp 
1800 1805 

ttc aac cca gtg tac cct tat gac ccc cca cat get cct gtt atg 
Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro His Ala Pro Val Met 
1810 1815 1820 

ccc ttc att act cca cct ttt acc tec teg gat ggg ttg cag gaa 
Pro Phe He Thr Pro Pro Phe Thr Ser Ser Asp Gly Leu Gin Glu 
1825 1830 " 1835 

aaa cca ctt gga gtg tta agt tta aac tac aga gat ccc att act 
Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp Pro He Thr 
1840 1845 ~ * 1850 



acg caa aat gag 
Thr Gin Asn Glu 



tct ctt aca att aaa eta gga aac ggc etc act 
Ser Leu Thr He Lys Leu Gly Asn Gly Leu Thr 
I 855 I860 1 1865 



eta gac aac cag gga caa eta aca tea acc get ggc gaa gta gaa 

Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala Gly Glu Val Glu 
1870 1875 " 1880 

cct cca etc act aac get aac aac aaa ctt gca ctg gtc tat age 

Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu Val Tyr Ser 
1885 1890 1895 

gat cct tta gca gta aag cgc aac age eta acc tta teg cac acc 

Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser His Thr 
1900 1905 1910 

get ccc ctt gtt att get gat aac tct tta gca ttg caa gtt tea 

Ala Pro Leu Val He Ala Asp Asn Ser Leu Ala Leu Gin Val Ser 
1915 1920 1925 



gag cct att ttt ata aat gac aag gac 
Glu Pro He Phe He Asn Asp Lys Asp 
1930 1935 



aaa eta gee ctg caa aca 
Lys Leu Ala Leu Gin Thr 
1940 



29024 



29069 



29114 



29160 



29209 



29254 



29299 



29344 



29389 



29434 



29479 



29524 



29569 



29614 
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gcc gcg ccc ctt 
Ala Ala Pro Leu 
1945 

gcc gcc cct tta 
Ala Ala Pro Leu 
1960 

acc aac cct ttg 
Thr Asn Pro Leu 
1975 

gaa cga ccc ctt 
Glu Arg Pro Leu 
1990 

etc tec cca ccg 
Leu Ser Pro Pro 
2005 

acc aac gtg cca 
Thr Asn Val Pro 
2020 

ata aag cgc cca 
lie Lys Arg Pro 
2035 

ttc aga get ccc 
Phe Arg Ala Pro 
2050 

ctt aac ttt tac 
Leu Asn Phe Tyr 
2065 

gtt aac aca ggc 
Val Asn Thr Gly 
2080 

tta aat ctt ggt 
Leu Asn Leu Gly 
2095 

gtc get ctt agt 
Val Ala Leu Ser 
2110 

cgc ctt aac gtg 
Arg Leu Asn Val 
2125 

aaa ttg gac gtg 
Lys Leu Asp Val 
2140 



gta act aac get ggc 
Val Thr Asn Ala Gly 
1950 

ggc att gca gac caa 
Gly lie Ala Asp Gin 
1965 

tac ttg cag aat aac 
Tyr Leu Gin Asn Asn 
1980 

gcc att acc aat act 
Ala lie Thr Asn Thr 
1995 

eta caa aca gca gac 
Leu Gin Thr Ala Asp 
2010 

tta act gta age aac 
Leu Thr Val Ser Asn 
2025 

ctt att att cag gac 
Leu lie lie Gin Asp 
2040 

ctg cgt ctt ttc aac 
Leu Arg Leu Phe Asn 
2055 

acc cct ctt gcg gta 
Thr Pro Leu Ala Val 
2070 

cgc ggc etc aca gtg 
Arg Gly Leu Thr Val 
2085 

aag gat ctt cgc ttt 
Lys Asp Leu Arg Phe 
2100 

get get ttg cct tta 
Ala Ala Leu Pro Leu 
2115 

ggc get ggg ctg cgt 
Gly Ala Gly Leu Arg 
2130 

aac ccc aat caa aac 
Asn Pro Asn Gin Asn 
2145 



acc ctt cgc tta 
Thr Leu Arg Leu 



acc eta aaa etc 
Thr Leu Lys Leu 



ttt etc acg tta 
Phe Leu Thr Leu 



gga aag ctg get 
Gly Lys Leu Ala 



aca ggc ttg act 
Thr Gly Leu Thr 



ggg acc eta ggc 
Gly Thr Leu Gly 



aac aac ttg ttt 
Asn Asn Leu Phe 



age gac cca gta 
Ser Asp Pro Val 



cgc gat gag gcg 
Arg Asp Glu Ala 



agt tac gat ggt 
Ser Tyr Asp Gly 



gac aac aac acc 
Asp Asn Asn Thr 



caa tac act gat 
Gin Tyr Thr Asp 



tac aat cca gtg 
Tyr Asn Pro Val 



aag ggt tta acc 
Lys Gly Leu Thr 



caa age 29659 

Gin Ser 

1955 

ctg ttt 29704 

Leu Phe 

1970 

gcc att 29749 

Ala He 

1985 

eta cag 29794 

Leu Gin 

2000 

ttg caa 29839 

Leu Gin 

2015 

tta gcc 29884 

Leu Ala 

2030 

ttg gac 29929 

Leu Asp 

2045 

eta ggg 29974 

Leu Gly 

2060 

etc act 30019 

Leu Thr 

2075 

tta att 30064 

Leu He 

2090 

gtt tct 30109 
Val Ser 
2105 ' 

cag ctt 30154 

Gin Leu 

2120 

agt aag 30199 

Ser Lys 

2135 

tgg gaa 30244 

Trp Glu 

2150 
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aat gac tac etc att gta aag eta gga aat gga tta ggt ttt gat 
Asn Asp Tyr Leu lie Val Lys Leu Gly Asn Gly Leu Gly Phe Asp 
2155 2160 2165 



30289 



ggc gat gga aac ata get gtt tct cct caa gtt aca teg cct gac 
Gly Asp Gly Asn lie Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
2170 2175 2180 



30334 



ace tta tgg ace act gec gac cca tec ccc aat tgt tec ate tac 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser lie Tyr 
2185 2190 2195 



30379 



act gat tta gat gee aaa atg tgg etc teg ttg gta aaa caa ggg 
Thr Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly 
2200 2205 2210 



30424 



ggt gtg gtt cac ggt tct gtt get tta aaa gca ttg aaa gga ace 
Gly Val Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr 
2215 2220 2225 



30469 



eta ttg agt cct acg gaa age gee att gtt att ata eta cat ttt 
Leu Leu Ser Pro Thr Glu Ser Ala lie Val lie lie Leu His Phe 
2230 2235 2240 



30514 



gac aat tat gga gtg cga att etc aat tat ccc act ttg ggc act 
Asp Asn Tyr Gly Val Arg lie Leu Asn Tyr Pro Thr Leu Gly Thr 
2245 2250 2255 



30559 



caa ggc acg ttg gga aat aat gca act tgg ggt tat agg cag gga 
Gin Gly Thr Leu Gly Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly 
2260 2265 2270 



30604 



gaa tct gca gac act aat gta etc aat gca eta gca ttt atg ccc 
Glu Ser Ala Asp Thr Asn Val Leu Asn Ala Leu Ala Phe Met Pro 
2275 2280 2285 



30649 



agt tea aaa agg tac cca aga ggg cgt gga age gaa gtt cag aat 
Ser Ser Lys Arg Tyr Pro Arg Gly Arg Gly Ser Glu Val Gin Asn 
2290 2295 2300 



30694 



caa act gtg ggc tac act tgt ata cag ggt gac ttt tct atg ccc 
Gin Thr Val Gly Tyr Thr Cys He Gin Gly Asp Phe Ser Met Pro 
2305 2310 2315 



30739 



gta ccg tac caa ata cag tac aac tat gga cca act ggc tac tec 
Val Pro Tyr Gin He Gin Tyr Asn Tyr Gly Pro Thr Gly Tyr Ser 
2320 2325 2330 



30784 



ttt aaa ttt att 
Phe Lys Phe He 
2335 



tgg aga act gtt tea aga caa cca ttt gac ate 
Trp Arg Thr Val Ser Arg Gin Pro Phe Asp He 
2340 2345 



30829 



cca tgc tgt ttt ttc tct tac att acg gaa gaa taa aacaactttt 
Pro Cys Cys Phe Phe Ser Tyr He Thr Glu Glu 
2350 2355 



30875 



tctttttatt ttctttttat tttacacgea cagtaaggct tcctccaccc ttccatctca 30935 
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cagcatacac cagcctctcc cccttcatgg cagtaaactg ttgtgagtca gtccggtatt 30995 

tgggagttaa gatccaaaca gtctctttgg tgatgaaaca tggatccgtg atggacacaa 31055 

atccctggga caggttctcc aacgtttcgg taaaaaactg catgccgccc tacaaaacaa 31115 

acaggttcag gctctccacg ggttatctcc ccgatcaaac tcagacagag taaaggtgcg 31175 

atgatgttcc actaaaccac gcaggtggcg ctgtctgaac ctctcggtgc gactcctgtg 31235 

aggctggtaa gaagttagat tgtccagcag cctcacagca tggatcatca gtctacgagt 31295 

gcgtctggcg cagcagcgca tctgaatctc actgagattc cggcaagaat cgcacaccat 31355 

cacaatcagg ttgttcatga tcccatagct gaacacgctc cagccaaagc tcattcgctc 31415 

caacagcgcc accgcgtgtc cgtccaacct tactttaaca taaatcaggt gtctgccgcg 31475 

tacaaacatg ctacccgcat acagaacctc ccggggcaaa cccctgttca ccacctgcct 31535 

gtaccaggga aacctcacat ttatcaggga gccatagata gccattttaa accaattagc 31595 

taacaccgcc ccaccagctc tacactgaag agaaccggga gagttacaat gacagtgaat 31655 

aatccatctc tcataacccc taatggtctg atggaaatcc agatctaacg tggcacagca 31715 

gatacacact ttcatataca ttttcatcac atgtttttcc caggccgtta aaatacaatc 31775 

ccaatacacg ggccactcct gcagtacaat aaagctaata caagatggta tactcctcac 31835 

ctcactaaca ttgtgcatgt tcatattttc acattctaag taccgagagt tctcctctac 31895 

aacagcactg ccgcggtcct cacaaggtgg tagctggtga cgattgtaag gagccagtct 31955 

gcagcgatac cgtctgtcgc gttgcatcgt agaccaggga ccgacgcact tcctcgtact 32015 

tgtagtagca gaaccacgtc cgctgccagc acgtctccaa gtaacgccgg tccctgcgtc 32075 

gctcacgctc cctcctcaac gcaaagtgca accactcttg taatccacac agatccctct 32135 

cggcctccgg ggcgatgcac acctcaaacc tacagatgtc tcggtacagt tccaaacacg 32195 

tagtgagggc gagttccaac caagacagac agcctgatct atcccgacac actggaggtg 32255 

gaggaagaca cggaagaggc atgttattcc aagcgattca ccaacgggtc gaaatgaaga 32315 

tcccgaagat gacaacggtc gcctccggag ccctgatgga atttaacagc cagatcaaac 32375 

attatgcgat tttccaggct atcaatcgcg gcctccaaaa gagcctggac ccgcacttcc 32435 

acaaacacca gcaaagcaaa agcgttatta tcaaactctt cgatcatcaa gctgcaggac 32495 

tgtacaatgc ccaagtaatt ttcatttctc cactcgcgaa tgatgtcgcg gcaaatagtc 32555 

tgaaggttca tgccgtgcat attaaaaagc tccgaaaggg cgccctctat agccatgcgt 32615 
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agacacacca tcatgactgc aagatatcgg gctcctgaga cacctgcagc agatttaaca 32675 

gacccaggtc aggttgctct ccgcgatcgc gaatctccat ccgcaaagtc atttgcaaat 32735 

aattaaatag atctgcgccg actaaatctg ttaactccgc gctaggaact aaatcaggtg 32795 

tggctacgca gcacaaaagt tccagggatg gcgccaaact cactagaacc gctcccgagt 32855 

agcaaaactg atgaatggga gtaacacagt gtaaaatgtt cagccaaaaa tcactaagct 32915 

gctcctttaa aaagtccagt acttctatat tcagttcgtg caagtactga agcaactgtg 32975 

cgggaatatg cacagcaaaa aaaatagggc ggctcagata catgttgacc taaaataaaa 33035 

agaatcatta aactaaagaa gcctggcgaa cggtgggata tatgacacgc tccagcagca 33095 

ggcaagcaac cggctgtccc cgggaaccgc ggtaaaattc atccgaatga ttaaaaagaa 33155 

caacagagac ttcccaccat gtactcggtt ggatctcctg agcacagagc aatacccccc 33215 

tcacattcat atccgctaca gaaaaaaaac gtcccagata cccagcggga atatccaacg 33275 

acagctgcaa agacagcaaa acaatccctc tgggagcaat cacaaaatcc tccggtgaaa 33335 

aaagcacata catattagaa taaccctgtt gctggggcaa aaaggcccgt cgtcccagca 33395 

aatgcacata aatatgttca tcagccattg ccccgtctta ccgcgtaaac agccacgaaa 33455 

aaatcgagct aaaatccacc caacagccta tagctatata tacactccac ccaatgacgc 33515 

taataccgca ccacccacga ccaaagttca cccacaccca caaaacccgc gaaaatccag 33575 

cgccgtcagc acttccgcaa tttcagtctc acaacgtcac ttccgcgcgc cttttcactt 33635 

tcccacacac gcccttcgcc cgcccgccct cgcgccaccc cgcgtcaccc cacgtcaccg 33695 

cacgtcaccc cggccccgcc tcgctcctcc ccgctcatta tcatattggc acgtttccag 33755 

aataaggtat attattgatg cagcaaaaca atccctctgg gagcaatcac aaaatcctcc 33815 

ggtgaaaaaa gcacatacat attagaataa ccctgttgct ggggcaaaaa ggcccgtcgt 33875 

cccagcaaat gcacataaat atgttcatca gccattgccc cgtcttaccg cgtaaacagc 33935 

cacgaaaaaa tcgagctaaa atccacccaa cagcctatag ctatatatac actccaccca 33995 

atgacgctaa taccgcacca cccacgacca aagttcaccc acacccacaa aacccgcgaa 34055 

aatccagcgc cgtcagcact tccgcaattt cagtctcaca acgtcacttc cgcgcgcctt 34115 

ttcactttcc cacacacgcc cttcgcccgc ccgccctcgc gccaccccgc gtcaccccac 34175 

gtcaccgcac gtcaccccgg ccccgcctcg ctcctccccg ctcattatca tattggcacg 34235 

tttccagaat aaggtatatt attgatgca 34264 
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<210> 25 

<211> 503 

<212> PRT 

<213> simian adenovirus SV-1 

<400> 25 

Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 
1 5 10 15 

Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 
20 25 30 

Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 
35 40 45 

Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 
50 55 60 

Val Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr 
65 70 75 80 

Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 
85 90 95 

Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr He Asn Phe Asp Glu Arg 
100 105 110 

Ser Arg Trp Gly Gly Gin Leu Lys Thr He Leu His Thr Asn Met Pro 
115 120 125 

Asn He Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu Met 
130 135 * 140 

Val Lys Lys Ala Glu Asn Gin Pro Pro Glu Tyr Glu Trp Phe Glu Phe 
145 150 155 * 160 

Thr He Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He Asp Leu Met 
165 170 175 

Asn Asn Ala He Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly 
180 185 190 

Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg 
195 200 205 

Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr 
210 215 220 

Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val 
225 230 235 240 

Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg 
245 250 * 255 

Arg Pro Phe Gin Glu Gly Phe Gin He Met Tyr Glu Asp Leu Glu Gly 
260 265 ^ 270 
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Gly Asn lie Pro Gly Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser Val 
275 280 285 

Lys Gin Ala Glu Ala Gin Gly Arg Glu lie Arg Gly Asp Thr Phe Ala 
290 295 300 

Thr Glu Pro His Glu Leu Val lie Lys Pro Leu Glu Gin Asp Ser Lys 
305 310 315 320 

Lys Arg Ser Tyr Asn lie lie Ser Gly Thr Met Asn Thr Leu Tyr Arg 
325 330 335 

Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
340 345 350 

Ser Trp Thr lie Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin 
355 360 365 

Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 
370 375 380 

Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu 
385 390 395 400 

Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 
405 410 415 

Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 
420 425 430 

Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val 
435 440 445 

Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 
450 455 ~ 460 

Ser Ser He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala Arg Arg 
465 470 475 480 

Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys 
485 490 495 



Val Leu Ser Ser Arg Thr Phe 
500 



<210> 26 

<211> 931 

<212> PRT 

<213> simian adenovirus SV-1 



<400> 26 

Met Ala Thr Pro Ser Met Met Pro 
1 5 



Gin Trp Ser Tyr Met His He Ala 
10 " 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 ~ 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 " 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 ~ 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys Val 
130 135 140 

Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala He Thr 
145 150 155 160 

Lys Glu Gly He Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser Thr 
165 170 175 

Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin He Gly Glu 
180 185 190 

Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys He Ala Gly Arg 
195 200 205 

Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala 
210 215 220 

Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser Gin 
225 230 235 240 

Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val Ala 
245 250 255 

Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser He Glu Ala 
260 . 265 270 

Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly He Thr 
275 280 285 

Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn 
290 295 300 



Tyr He Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser 
305 310 315 ™ 320 
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Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
325 330 335 

Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
340 " 345 350 

Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin 
355 "* 360 365 

Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn His Gly 
370 * " 375 380 

Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Ala 
385 390 395 400 

Val Thr Asp Thr Tyr Ser Pro lie Lys Val Asn Gly Gly Gly Asn Gly 
405 410 415 

Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu lie Gly 
420 425 430 

Ser Gly Asn Met Phe Ala Met Glu lie Asn Leu Gin Ala Asn Leu Trp 
435 440 445 

Arg Ser Phe Leu Tyr Ser Asn lie Gly Leu Tyr Leu Pro Asp Ser Leu 
450 455 460 

Lys lie Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr Tyr 
465 470 475 480 

Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr Tyr 
485 490 495 

Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser He Asn 
500 505 510 

Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 
515 520 525 

Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys 
530 535 540 

Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr 
545 550 555 560 

Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser Leu 
565 570 575 

Gly Asn Asp Leu Arg Val Asp Gly Ala Ser He Arg Phe Asp Ser He 
580 585 590 

Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr 
595 600 605 



Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp 
610 615 620 
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Tyr Leu Cys Ala Ala Asn Met 
625 630 



Leu Tyr Pro lie Pro Ala Asn Ala Thr 
635 640 



Ser Val Pro He Ser He Pro 
645 



Ser Arg Asn Trp Ala Ala Phe Arg Gly 
650 655 



Trp Ser Phe Thr Arg Leu Lys 
660 



Thr Lys Glu Thr Pro Ser Leu Gly Ser 
665 670 



Gly Phe Asp Pro Tyr Phe Val 
675 



Tyr Ser Gly Ser He Pro Tyr Leu Asp 
680 685 



Gly Thr Phe Tyr Leu Asn His 
690 695 



Thr Phe Lys Lys Val Ser He Met Phe 
700 



Asp Ser Ser Val Ser Trp Pro 
705 710 



Gly Asn Asp Arg Leu Leu Thr Pro Asn 
715 720 



Glu Phe Glu He Lys Arg Ser 
725 



Val Asp Gly Glu Gly Tyr Asn Val Ala 
730 735 



Gin Ser Asn Met Thr Lys Asp 
740 



Trp Phe Leu He Gin Met Leu Ser His 
745 750 



Tyr Asn He Gly Tyr Gin Gly 
755 



Phe Tyr Val Pro Glu Asn Tyr Lys Asp 
760 ~ 765 



Arg Met Tyr Ser Phe Phe Arg 
770 775 



Asn Phe Gin Pro Met Ser Arg Gin He 
780 



Val Asp Ser Thr Ala Tyr Thr 
785 790 



Asn Tyr Gin Asp Val Lys Leu Pro Tyr 
795 800 



Gin His Asn Asn Ser Gly Phe 
805 



Val Gly Tyr Met Gly Pro Thr Met Arg 
810 815 



Glu Gly Gin Ala Tyr Pro Ala 
820 



Asn Tyr Pro Tyr Pro Leu He Gly Ala 
825 830 



Thr Ala Val Pro Ser Leu Thr 
835 



Gin Lys Lys Phe Leu Cys Asp Arg Val 
840 ~ 845 



Met Trp Arg He Pro Phe Ser 
850 ' 855 



Ser Asn Phe Met Ser Met Gly Ser Leu 
860 



Thr Asp Leu Gly Gin Asn Met 
865 870 



Leu Tyr Ala Asn Ser Ala His Ala Leu 
875 880 



Asp Met Thr Phe Glu Val Asp 
885 



Pro Met Asp Glu Pro Thr Leu Leu Tyr 
890 895 



Val Leu Phe Glu Val Phe Asp 
900 



Val Val Arg lie His Gin Pro His Arg 
905 910 
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Gly Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 
915 920 925 

Ala Thr Thr 
930 



<210> 27 
<211> 363 
<212> PRT 

<213> simian adenovirus SV-1 
<400> 27 

Met Lys Arg Thr Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Thr Thr Thr Thr Pro Ala Val Pro Phe He Ser Pro Pro Phe Val 
20 25 30 

Asn Ser Asp Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg 
35 40 45 

lie Ala Lys Pro Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser 
50 55 60 

Leu Gly Arg Gly Leu Thr He Thr Ala Ala Gly Gin Leu Glu Ser Thr 
65 70 75 80 

Gin Ser Val Gin Thr Asn Pro Pro Leu He He Thr Asn Asn Asn Thr 
85 90 95 

Leu Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 
100 105 110 

Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu Thr 
115 120 125 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu Thr Phe 
130 135 140 

Asn Tyr Thr Glu Pro Leu Lys Leu He Asn Asn Ser Leu Ala He Asp 
145 150 155 160 

He Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly Ser Leu Ala Val 
165 170 "* "* 175 

Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn Gly Ser He Ala Phe 
180 185 190 

Gly He Gin Thr Leu Trp Thr Ala Pro Thr Ser Thr Gly Asn Cys Thr 
195 200 205 

Val Tyr Ser Glu Gly Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
210 215 220 



147 



WO 03/046124 PCT/US02/33645 



Gly Ala His Val Leu Gly Ser Val Ser Leu Thr Gly Leu Thr Gly Thr 
225 230 235 240 

He Thr Gin Met Thr Asp He Ser Val Thr He Gin Phe Thr Phe Asp 
245 250 255 

Asn Asn Gly Lys Leu Leu Ser Ser Pro Leu He Asn Asn Ala Phe Ser 
260 265 270 

He Arg Gin Asn Asp Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu 
275 280 285 

Ala Phe Met Pro Asn Ser Thr He Tyr Ala Arg Gly Gly Gly Gly Glu 
290 295 300 

Pro Arg Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Gin 
305 310 315 320 

Lys Pro He He Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr 
325 330 335 

Ser Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 
340 345 350 

Pro Thr Thr Ser Phe Cys Tyr He Thr Glu Gin 
355 360 



<210> 28 
<211> 560 
<212> PRT 

<213> simian adenovirus SV-1 
<400> 28 

Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Pro Pro His Ala Pro Val Met Pro Phe He Thr Pro Pro Phe Thr 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 
35 40 45 

Tyr Arg Asp Pro He Thr Thr Gin Asn Glu Ser Leu Thr He Lys Leu 
50 55 60 

Gly Asn Gly Leu Thr Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala 
65 ~ 70 75 80 

Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
85 90 95 

Val Tyr Ser Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser 
100 105 110 
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His Thr Ala Pro Leu Val lie Ala Asp Asn Ser Leu Ala Leu Gin Val 
115 120 125 

Ser Glu Pro lie Phe lie Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 
130 135 140 

Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly lie Ala Asp Gin Thr Leu Lys Leu Leu Phe Thr Asn 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala lie Glu Arg Pro 
180 185 190 

Leu Ala lie Thr Asn Thr Gly Lys Leu Ala Leu Gin Leu Ser Pro Pro 
195 200 205 

Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin Thr Asn Val Pro Leu 
210 215 220 

Thr Val Ser Asn Gly Thr Leu Gly Leu Ala lie Lys Arg Pro Leu lie 
225 ~ 230 235 240 

lie Gin Asp Asn Asn Leu Phe Leu Asp Phe Arg Ala Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Asn Phe Tyr Thr Pro Leu Ala 
260 265 270 

Val Arg Asp Glu Ala Leu Thr Val Asn Thr Gly Arg Gly Leu Thr Val 
275 280 285 

Ser Tyr Asp Gly Leu lie Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp 
290 295 300 

Asn Asn Thr Val Ser Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr 
305 310 315 320 

Thr Asp Gin Leu Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro 
325 330 335 

Val Ser Lys Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Glu Asn Asp Tyr Leu lie Val Lys Leu Gly Asn Gly Leu Gly Phe 
355 " 360 365 

Asp Gly Asp Gly Asn lie Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
370 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser lie Tyr Thr 
385 390 395 400 



Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly Gly Val 
405 410 415 
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Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Ser 
420 425 430 

Pro Thr Glu Ser Ala He Val He He Leu His Phe Asp Asn Tyr Gly 
435 440 445 

Val Arg He Leu Asn Tyr Pro Thr Leu Gly Thr Gin Gly Thr Leu Gly 
450 455 460 

Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly Glu Ser Ala Asp Thr Asn 
465 470 475 480 

Val Leu Asn Ala Leu Ala Phe Met Pro Ser Ser Lys Arg Tyr Pro Arg 
485 490 495 

Gly Arg Gly Ser Glu Val Gin Asn Gin Thr Val Gly Tyr Thr Cys He 
500 505 510 

Gin Gly Asp Phe Ser Met Pro Val Pro Tyr Gin He Gin Tyr Asn Tyr 
515 520 525 

Gly Pro Thr Gly Tyr Ser Phe Lys Phe He Trp Arg Thr Val Ser Arg 
530 535 540 

Gin Pro Phe Asp He Pro Cys Cys Phe Phe Ser Tyr He Thr Glu Glu 
545 550 555 560 



<210> 29 

<211> 31044 

<212> DNA 

<213> simian adenovirus SV-25 
<220> 

<221> CDS 

<222> (12284) . . (13801) 

<223> Penton 



<220> 

<221> CDS 

<222> (16681) 

<223> Hexon 



(19446) 



<220> 

<221> CDS 

<222> (25380) . . (26423) 

<223> Fiber #2 

<220> 

<221> CDS 

<222> (26457) . . (28136) 

<223> Fiber #1 



<400> 29 

catcatcaat aatatacctt attctggaaa cgtgccaata tgataatgag cggggaggag 



60 
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cgaggcgggg 


ccggggtgac 


gtgcggtgac 


gcggggtggc 


gcgagggcgg 


ggcgaagggc 


120 


gcgggtgtgt 


gtgtgggagg 


cgcttagttt 


ttacgtatgc 


ggaaggaggt 


tttataccgg 


180 


aagatgggta 


atttgggcgt 


atacttgtaa 


gttttgtgta 


atttggcgcg 


aaaactgggt 


240 


aatgaggaag 


ttgaggttaa 


tatgtacttt 


ttatgactgg 


gcggaatttc 


tgctgatcag 


300 


cagtgaactt 


tgggcgctga 


cggggaggtt 


tcgctacgtg 


acagtaccac 


gagaaggctc 


360 


aaaggtccca 


tttattgtac 


tcttcagcgt 


tttcgctggg 


tatttaaacg 


ctgtcagatc 


420 


atcaagaggc 


cactcttgag 


tgctggcgag 


aagagttttc 


tcctccgtgc 


tgccacgatg 


480 


aggctggtcc 


ccgagatgta 


cggtgttttt 


agcgacgaga 


cggtgcgtaa 


ctcagatgac 


540 


ctgctgaatt 


cagacgcgct 


ggaaatttcc 


aattcgcctg 


tgctttcgcc 


gccgtcactt 


600 


cacgacctgt 


ttgtgttttg 


gctcaacgct 


tagcaacgtg 


ttatataggg 


tcaagaagga 


660 


gcaggagacg 


cagtttgcta 


ggctgttggc 


cgatactcct 


ggagtttttg 


tggctctgga 


720 


tctaggccat 


cactctcttt 


tccaagagaa 


aattatcaaa 


aacttaactt 


ttacgtctcc 


780 


tggtcgcacg 


gttgcttccg 


ctgcctttat 


tacctatatt 


ttggatcaat 


ggagcaacag 


840 


cgacagccac 


ctgtcgtggg 


agtacatgct 


ggattacatg 


tcgatggcgc 


tgtggagggc 


900 


catgctgcgg 


aggagggttt 


gcatttactt 


gcgggcgcag 


cctccgcggc 


tggaccgagt 


960 


ggaggaggag 


gacgagccgg 


gggagaccga 


gaacctgagg 


gccgggctgg 


accctccaac 


1020 


ggaggactag 


gtgctgagga 


tgatcccgaa 


gaggggacta 


gtggggctag 


gaagaagcaa 


1080 


aagactgagt 


ctgaacctcg 


aaactttttg 


aatgagttga 


ctgtgagttt 


gatgaatcgt 


1140 


cagcgtccgg 


agacaatttt 


cfcggtctgaa 


ttggaggagg 


aattcaggag 


gggggaactg 


1200 


aacctgctat 


acaagtatgg 


gtttgaacag 


ttaaaaactc 


actggttgga 


gccgtgggag 


1260 


gattttgaaa 


ccgccttgga 


cacttttgct 


aaagtggctc 


tgcggccgga 


taaggtttac 


1320 


actatccgcc 


gcactgttaa 


cataaagaag 


agtgtttatg 


ttataggcca 


tggagctctg 


1380 


gtgcaggtgc 


aaaccgtcga 


ccgggtggcc 


tttagttgcg 


gtatgcaaaa 


tctgggcccc 


1440 


ggggtgatag 


gcttaaatgg 


tgtaacattt 


cacaatgtaa 


ggtttactgg 


tgaaagtttt 


1500 


aacggctctg 


tgtttgcaaa 


taacacacag 


ctgacgctcc 


acggcgttta 


cttttttaac 


1560 


tttaataaca 


catgtgtgga 


gtcgtggggc 


agggtgtctt 


tgaggggctg 


ctgttttcac 


1620 


ggctgctgga 


aggcggtggt 


gggaagactt 


aaaagtgtaa 


catctgtaaa 


aaaatgcgtg 


1680 


tttgagcggt 


gtgtgttggc 


tttaactgtg 


gagggctgtg 


gacgcattag 


gaataatgcg 


1740 
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gecattaact 


10080 


attcgatgea 


aaacctaaaa 

yuywuyyyy 


aagttctacg 


ctegcaagat 


ctacaagagc 


ccttacgtgc 


10140 


ccatagacaa 


ggaggtgaag 


atagacagct 


tttacatgcg 


catggcgctg 


aaggtgctga 


10200 


egctgagega 


cgacctcggc 


gtgtaccgta 


acgacaagat 


ccacaaggcg 


gtgagegeca 


10260 


gccgccggcg 


ggagctgagc 


gacagggagc 


tgatgcacag 


ectgeagagg 


gcgctggcgg 


10320 
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gcgccgggga 


cgaggagcgc 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


10380 


ccagcgcgcg 


cgccttggag 


gcggcgggtt 


atcccgacga 


ggaggatcgg 


gacgatttgg 


10440 


aggaggcagg 


cgagtacgag 


gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


10500 


gccggcggac 


gggaccaccg 


cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


10560 


gggcgtgacc 


gcctccgatg 


actgggcggc 


ggccatggac 


cgcatcatgg 


cgctgaccac 


10620 


ccgcaacccc 


gaggctttta 


ggcagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


10680 


agcggtggtg 


ccgtcgcgca 


ccaacccgac 


gcacgagaaa 


gtcctgacta 


tcgtgaacgc 


10740 


cctggtagac 


agcaaggcca 


tccgccgtga 


cgaggcgggc 


ttgatttaca 


acgctctttt 


10800 


ggaacgcgtg 


gcgcgctaca 


acagcactaa 


cgtgcagacc 


aatctggacc 


gcctcaccac 


10860 


cgacgtgaag 


gaggcgctgg 


cgcagaagga 


gcggtttctg 


agggacagta 


atctgggctc 


10920 


tctggtggca 


ctgaacgcct 


tcctgagctc 


acagccggcc 


aacgtgcccc 


gcgggcagga 


10980 


ggattacgtg 


agcttcatca 


gcgctctgag 


actgctggtg 


tccgaggtgc 


cccagagcga 


11040 


ggtgtaccag 


tctgggccgg 


attacttttt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


11100 


gaacctgact 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggggtcaagg 


ccccggtggg 


11160 


cgatcgcgcc 


actatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


11220 


cgcaccgttt 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


11280 


tctgtaccgc 


gaggccatcg 


gccaggctca 


gatcgacgag 


catacgtatc 


aggagattac 


11340 


taacgtgagc 


cgtgccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


11400 


tttgctaacc 


aaccggaggc 


aaaaaatacc 


ctcccagttc 


acgttaagcg 


ccgaggagga 


11460 


gaggattctg 


cgatacgtgc 


agcagtccgt 


gagcctgtac 


ttgatgcgcg 


agggcgccac 


11520 


cgcttccacg 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


11580 


ccggccgttc 


attaaccgtc 


tgatggacta 


cttccatcgc 


gcggccgcca 


tgaacgggga 


11640 


gtacttcacc 


aatgccatcc 


tgaatccgca 


ttggatgccc 


ccgtccggct 


tctacaccgg 


11700 


ggagtttgac 


ctgcccgaag 


ccgacgacgg 


ctttctgtgg 


gacgacgtgt 


ccgatagcat 


11760 


tttcacgccg 


gctaatcgcc 


gattccagaa 


gaaggagggc 


ggagacgagc 


tccccctctc 


11820 


cagcgtggaa 


gcggcctcaa 


ggggagagag 


tccctttcca 


agtctgtctt 


ccgccagtag 


11880 


cggtcgggta 


acgcgtccac 


ggttgccggg 


ggagagcgac 


tacctgaacg 


accccttgct 


11940 


gcgaccggct 


agaaagaaaa 


attttcccaa 


taacggggtg 


gaaagcttgg 


tggataaaat 


12000 
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gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggcc 12060 

gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120 

tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180 

gcgctcgagt cgcctgtatt aataaaaaag acgcggaaac ttaccagagc catggccaca 12240 

gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg 12295 

Met Arg Arg Ala 
1 



gtg aga gtc acg ccg gcg gcg tat gag ggc ccg ccc cct tct tac gaa 
Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu 
5 10 15 20 



12343 



age gtg atg gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac 12391 

Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr 

25 30 35 

gtt cct ccc aga tac ctg gga cct acg gag ggc aga aac age ate cgt 12439 

Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg 
40 45 50 



tac tec gag ctg gcg ccc ctg tac gat acc ace aag gtg tac ctg gtg 
Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val 
55 60 65 



12487 



gac aac aag teg gcg gac ate gee tec ctg aat tac caa aac gat cac 12535 
Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His 
70 75 80 

agt aac ttt ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg 12583 
Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr 
85 90 95 100 



gag gcg ggc acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc 
Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg Trp Gly 
105 110 " 115 



12631 



ggt cag ctg aaa acc ate ctg cac acc aac atg ccc aac ate aac gag 
Gly Gin Leu Lys Thr lie Leu His Thr Asn Met Pro Asn lie Asn Glu 
120 125 130 



12679 



ttc atg tec acc aac aag ttc agg get aag ctg atg gta gaa aaa agt 
Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met Val Glu Lys Ser 
135 140 145 



12727 



aat gcg gaa act egg cag ccc cga tac gag tgg ttc gag ttt acc att 
Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe Glu Phe Thr lie 
150 155 160 



12775 



cca gag ggc aac tat tec gaa act atg act ate gat etc atg aat aac 
Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr lie Asp Leu Met Asn Asn 
165 170 175 180 



12823 
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gcg ate gtg gac aat tac ctg caa gtg ggg aga cag aac ggg gtg ctg 12871 
Ala lie Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu 
185 190 195 

gaa age gat ate ggc gtg aaa ttc gat acc aga aac ttc cga ctg ggg 12919 
Glu Ser Asp lie Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
200 205 ~ 210 

tgg gat ccc gtg acc aag ctg gtg atg cca ggc gtg tac acc aac gag 12967 
Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
215 220 ~ 225 

get ttt cac ccg gac ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc 13015 
Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe 
230 235 240 

act cag age cgt ttg agt aac ctg tta gga att aga aag cgc cgc ccc 13063 
Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Arg Pro 
245 250 255 260 

ttc caa gag ggc ttt caa ate atg tat gag gac ctg gag gga ggt aat 13111 
Phe Gin Glu Gly Phe Gin He Met Tyr Glu Asp Leu Glu Gly Gly Asn 
265 270 275 

ata ccc gee tta ctg gac gtg teg aag tac gaa get age ata caa cgc 13159 
He Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala Ser He Gin Arg 
280 285 290 

gec aaa gcg gag ggt aga gag att egg gga gac acc ttt gcg gta get 13207 
Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr Phe Ala Val Ala 
295 300 305 

ccc cag gac ctg gaa ata gtg cct tta act aaa gac age aaa gac aga 13255 
Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp Ser Lys Asp Arg 
310 315 " 320 

age tac aat att ata aac aac acg acg gac acc ctg tat egg age tgg 13303 
Ser Tyr Asn He He Asn Asn Thr Thr Asp Thr Leu Tyr Arg Ser Trp 
325 330 335 340 

ttt ctg get tac aac tac gga gac ccc gag aaa gga gtg aga tea tgg 13351 
Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp 
345 350 355 

acc ata etc acc acc acg gac gtg acc tgt ggc teg cag caa gtg tac 13399 
Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr 
360 365 370 

tgg tec ctg ccg gat atg atg caa gac ccg gtc acc ttc cgc ccc tec 13447 
Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser 
375 380 385 

acc caa gtc age aac ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc 13495 
Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val 
390 395 400 
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cat gcc aag age ttc tac aac gag cag gec gtc tac teg caa ctt att 13543 
His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu lie 
405 410 415 420 

cgc cag tec ace gcg ctt acc cac gtg ttc aat cgc ttt ccc gag aac 13591 
Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn 
425 430 435 

cag att ctg gtg cgc cct ccc get cct acc att acc acc gtc agt gaa 13639 
Gin lie Leu Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu 
440 445 450 

aac gtt ccc gcc etc aca gat cac gga acc ctg ccg ctg cgc age agt 13687 
Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser 
455 460 465 

ate agt gga gtt cag cgc gtg acc ate acc gac gcc aga cgt cga acc 13735 
lie Ser Gly Val Gin Arg Val Thr lie Thr Asp Ala Arg Arg Arg Thr 
470 475 480 

tgc ccc tac gtt tac aaa gcg ctt ggc gtg gtg get cct aaa gtt ctt 13783 
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu 
485 ~ 490 ~ 495 500 

tct agt cgc acc ttc taa aaacatgtcc atcctcatct ctcccgataa 13831 
Ser Ser Arg Thr Phe 
505 



caacaccggc 


tggggactgg 


gctccggcaa 


gatgtaegge 


ggagccaaaa 


ggcgctccag 


13891 


tcagcaccca 


gttcgagttc 


ggggecaett 


ccgcgctcct 


tggggagctt 


acaagegagg 


13951 


actctegggt 


egaaeggctg 


tagacgatac 


catagatgee 


gtgattgccg 


acgcccgccg 


14011 


gtacaacccc 


ggaceggteg 


ctagcgccgc 


ctccaccgtg 


gattccgtga 


tegacagegt 


14071 


ggtagcegge 


gctcgggcct 


atgctcgccg 


caagaggegg 


ctgcatcgga 


gacgtcgccc 


14131 


caccgccgcc 


atgctggcag 


ccagggccgt 


getgaggegg 


geceggaggg 


caggcagaag 


14191 


ggctatgege 


cgcgctgccg 


ccaacgccgc 


egcegggagg 


gcccgccgac 


aggctgcccg 


14251 


ccaggctgcc 


gctgccatcg 


ctagcatggc 


cagacccagg 


agagggaacg 


tgtactgggt 


14311 


gcgtgattct 


gtgaegggag 


tccgagtgcc 


ggtgcgcagc 


cgacctcccc 


gaagttagaa 


14371 


gatccaagct 


gegaagaegg 


eggtactgag 


tctccctgtt 


gttatcagee 


caacatgagc 


14431 


aagcgcaagt 


ttaaagaaga 


actgetgeag 


acgctggtgc 


ctgagatcta 


tggccctccg 


14491 


gaegtgaage 


cagacattaa 


gccccgcgat 


ateaagegtg 


ttaaaaagcg 


ggaaaagaaa 


14551 


gaggaactcg 


cggtggtaga 


cgatggcgga 


gtggaattta 


ttaggagttt 


cgccccgcga 


14611 


cgcagggttc 


aatggaaagg 


gcggcgggta 


caacgcgttt 


tgaggccggg 


caccgcggta 


14671 


gtttttaccc 


egggagageg 


gtcggccgtt 


aggggtttca 


aaaggcagta 


cgacgaggtg 


14731 
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tacggcgacg 


aggacatatt 


ggaacaggcg 


gctcaacaga 


tcggagaatt 


tgcctacgga 


14791 


aagcgttcgc 


gtcgcgaaga 


cctggccatc 


gccttagaca 


gcggcaaccc 


cacgcccagc 


14851 


ctcaaacccg 


tgacgctgca 


gcaggtgctt 


cccgtgagcg 


ccagcacgga 


cagcaagagg 


14911 


gggattaaga 


gagaaatgga 


agatctgcat 


cccaccatcc 


aactcatggt 


ccctaaacgg 


14971 


cagaggctgg 


aagaggtcct 


ggagaagatg 


aaagtggacc 


ccagcataga 


gccggatgta 


15031 


aaagtcagac 


ctattaagga 


agtggccccc 


ggtcttgggg 


tgcaaacggt 


ggacattcaa 


15091 


atccccgtca 


ccaccgcttc 


aaccgccgtg 


gaagctatgg 


aaacgcaaac 


ggagacccct 


15151 


gccgcgatcg 


gtaccaggga 


agtggcgttg 


caaacggagc 


cttggtacga 


atacgcagcc 


15211 


cctcggcgtc 


agaggcgttc 


cgctcgttac 


ggccccgcca 


acgccatcat 


gccagaatat 


15271 


gcgctgcatc 


cgtctattct 


gcccactccc 


ggataccggg 


gtgtgacgta 


tcgcccgtct 


15331 


ggaacccgcc 


gccgaacccg 


tcgccgccgc 


cgctcccgtc 


gcgctctggc 


ccccgtgtcg 


15391 


gtgcggcgtg 


tgacccgccg 


gggaaagaca 


gtcgtcattc 


ccaacccgcg 


ttaccaccct 


15451 


agcatccttt 


aataactctg 


ccgttttgca 


gatggctctg 


acttgccgcg 


tgcgccttcc 


15511 


cgttccgcac 


tatcgaggaa 


gatctcgtcg 


taggagaggc 


atgacgggca 


gtggtcgccg 


15571 


gcgggctttg 


cgcaggcgca 


tgaaaggcgg 


aattttaccc 


gccctgatac 


ccataattgc 


15631 


cgccgccatc 


ggtgccatac 


ccggcgttgc 


ttcagtggcg 


ttgcaagcag 


ctcgtaataa 


15691 


ataaacaaag 


gcttttgcac 


ttatgacctg 


gtcctgacta 


ttttatgcag 


aaagagcatg 


15751 


gaagacatca 


attttacgtc 


gctggctccg 


cggcacggct 


cgcggccgct 


catgggcacc 


15811 


tggaacgaca 


tcggcaccag 


tcagctcaac 


gggggcgctt 


tcaattgggg 


gagcctttgg 


15871 


agcggcatta 


aaaactttgg 


ctccacgatt 


aaatcctacg 


gcagcaaagc 


ctggaacagt 


15931 


agtgctggtc 


agatgctccg 


agataaactg 


aaggacacca 


acttccaaga 


aaaagtggtc 


15991 


aatggggtgg 


tgaccggcat 


ccacggcgcg 


gtagatctcg 


ccaaccaagc 


ggtgcagaaa 


16051 


gagattgaca 


ggcgtttgga 


aagctcgcgg 


gtgccgccgc 


agagagggga 


tgaggtggag 


16111 


gtcgaggaag 


tagaagtaga 


ggaaaagctg 


cccccgctgg 


agaaagttcc 


cggtgcgcct 


16171 


ccgagaccgc 


agaagcggcc 


caggccagaa 


ctagaagaga 


ctctggtgac 


ggagagcaag 


16231 


gagcctccct 


cgtacgagca 


agccttgaaa 


gagggcgcct 


ctccaccctc 


ctacccgatg 


16291 


actaagccga 


tcgcacccat 


ggctcgaccg 


gtgtacggca 


aggattacaa 


gcccgtcacg 


16351 


ctagagctgc 


ccccaccgcc 


ccccacgcgc 


ccgaccgtcc 


cccccctgcc 


gactccgtcg 


16411 
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gcggccgcgg cgggacccgt gtccgcacca tccgctgtgc ctctgccagc cgcccgtcca 16471 

gtggccgtgg ccactgccag aaaccccaga ggccagagag gagccaactg gcaaagcacg 16531 

ctgaacagca tcgtgggcct gggagtgaaa agcctgaaac gccgccgttg ctattattaa 16591 

aaaagtgtag ctaaaaagtc tcccgttgta tacgcctcct atgttaccgc cagagacgag 16651 

tgactgtcgc cgcgagcgcc gctttcaag atg gcc acc cca teg atg atg ccg 16704 

Met Ala Thr Pro Ser Met Met Pro 
510 



cag tgg tct tac atg cac ate gcc ggc cag gac gcc teg gag tac ctg 16752 
Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu 
515 "* 520 525 

agt ccc ggc etc gtg cag ttt gcc cgc gcc acc gac acc tac ttc age 16800 
Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 
530 ^ 535 540 545 

ttg gga aac aag ttt aga aac ccc acc gtg gcc ccc acc cac gat gtg 16848 
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 
550 555 560 

acc acg gac cgc teg cag agg ctg acc ctg cgc ttt gtg ccc gta gac 16896 
Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 
565 570 575 

egg gag gac acc gcg tac tct tac aaa gtg cgc tac acg ttg gcc gta 16944 
Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
580 585 590 

ggg gac aac cga gtg ctg gac atg gcc age acc tac ttt gac ate egg 16992 
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg 
595 600 605 

ggg gtg ctg gat egg ggt ccc age ttc aag ccc tat tec ggc acc get 17040 
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 
610 615 620 625 

tac aac tec ctg gcc ccc aag gga get ccc aac ccc teg gaa tgg acg 17088 
Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro Ser Glu Trp Thr 
630 635 640 

gac act tec gac aac aaa ctt aaa gca tat get cag get ccc tac cag 17136 
Asp Thr Ser Asp Asn Lys Leu Lys Ala Tyr Ala Gin Ala Pro Tyr Gin 
645 650 655 

agt caa gga ctt aca aag gat ggt att cag gtt ggg eta gtt gtg aca 17184 
Ser Gin Gly Leu Thr Lys Asp Gly He Gin Val Gly Leu Val Val Thr 
660 665 670 

gag tea gga caa aca ccc caa tat gca aac aaa gtg tac caa ccc gag 17232 
Glu Ser Gly Gin Thr Pro Gin Tyr Ala Asn Lys Val Tyr Gin Pro Glu 
675 680 685 
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cca caa att ggg gaa aac caa tgg aat tta gaa caa gaa gat aaa gcg 17280 

Pro Gin lie Gly Glu Asn Gin Trp Asn Leu Glu Gin Glu Asp Lys Ala 

690 695 700 705 

gcg gga aga gtc eta aag aaa gat acc cct atg ttt ccc tgc tat ggg 17328 

Ala Gly Arg Val Leu Lys Lys Asp Thr Pro Met Phe Pro Cys Tyr Gly 

710 715 720 

tea tat gec agg ccc aca aac gaa caa gga ggg cag gca aaa aac caa 17376 

Ser Tyr Ala Arg Pro Thr Asn Glu Gin Gly Gly Gin Ala Lys Asn Gin 

725 730 735 

gaa gta gat tta cag ttt ttt gec act ccg ggc gac acc cag aac acg 17424 

Glu Val Asp Leu Gin Phe Phe Ala Thr Pro Gly Asp Thr Gin Asn Thr 

740 745 750 

get aaa gtg gta ctt tat get gaa aat gtc aac ctg gaa act cca gat 17472 

Ala Lys Val Val Leu Tyr Ala Glu Asn Val Asn Leu Glu Thr Pro Asp 

755 " 760 765 

act cac tta gtg ttt aaa ccc gat gac gac age acc agt tea aaa ctt 17520 

Thr His Leu Val Phe Lys Pro Asp Asp Asp Ser Thr Ser Ser Lys Leu 

770 775 780 785 

ctt ctt ggg cag cag get gca cct aac aga ccc aac tac ata ggt ttt 17568 

Leu Leu Gly Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr lie Gly Phe 

790 795 800 

aga gat aat ttt att ggt tta atg tac tac aat age act gga aac atg 17616 

Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 

805 " 810 815 

ggc gtg ctg gec gga cag get tct caa ttg aat gee gta gtc gac ttg 17664 

Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 

820 825 830 

cag gac aga aac acc gag ttg tec tac cag ctg atg ctg gac gca ctg 17712 

Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met Leu Asp Ala Leu 

835 840 845 

ggg gat cgc age cga tat ttt tea atg tgg aat cag gca gta gac age 17760 

Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 

850 " 855 860 865 

tat gac cca gac gtt aga att ata gaa aac cac gga gtg gaa gac gaa 17808 

Tyr Asp Pro Asp Val Arg lie lie Glu Asn His Gly Val Glu Asp Glu 

870 875 880 

ctg cca aac tat tgt ttt cct ctg gga gga atg gtg gtg act gac aat 17856 

Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Val Val Thr Asp Asn 

885 890 895 

tac aac tct gtg acg cct caa aat gga ggc agt gga aat aca tgg cag 17904 

Tyr Asn Ser Val Thr Pro Gin Asn Gly Gly Ser Gly Asn Thr Trp Gin 

900 905 "* 910 
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gca gac aat act aca ttt agt caa aga gga gcg cag att ggc tec gga 
Ala Asp Asn Thr Thr Phe Ser Gin Arg Gly Ala Gin lie Gly Ser Gly 
915 920 925 



17952 



aac atg ttt gec ctg gaa att aac eta cag gec aac etc tgg cgc ggc 
Asn Met Phe Ala Leu Glu lie Asn Leu Gin Ala Asn Leu Trp Arg Gly 
930 935 940 945 



18000 



ttc ttg tat tec aat att ggg ttg tat ctt cca gac tct ctg aaa ate 
Phe Leu Tyr Ser Asn lie Gly Leu Tyr Leu Pro Asp Ser Leu Lys lie 
950 955 960 



18048 



ace ccc gac aac ate acg ctg cca gaa aac aaa aac act tat cag tac 
Thr Pro Asp Asn lie Thr Leu Pro Glu Asn Lys Asn Thr Tyr Gin Tyr 
965 970 975 



18096 



atg aac ggt cgc gta acg cca ccc ggg etc ata gac ace tat gta aac 
Met Asn Gly Arg Val Thr Pro Pro Gly Leu lie Asp Thr Tyr Val Asn 
980 985 990 



18144 



gtg ggc gcg cgc tgg tec ccc gat gtc atg gac age att aac ccc ttc 
Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser lie Asn Pro Phe 
995 1000 1005 



18192 



aac cac cac cgt aac gcg ggc ttg cgc tac cgc tec atg etc ttg 
Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu 
1010 1015 1020 



18237 



ggc aac ggc cgt tat gtg cct ttt cac att cag gtg ccc caa aaa 
Gly Asn Gly Arg Tyr Val Pro Phe His lie Gin Val Pro Gin Lys 
1025 " " 1030 1035 



18282 



ttc ttt gee att aaa aac ctg ctg ctt etc ccc ggt tec tat ace 
Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr 
1040 1045 1050 



18327 



tat gag tgg aac ttc cgc aag gat gtc aac atg ate ctg cag age 18372 
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 
1055 1060 1065 

teg ctg ggt aat gac ctg cga gtg gac ggg gec age ata cgc ttt 18417 
Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser He Arg Phe 
1070 * " 1075 1080 



gac age att aac ctg tat gee aac ttt ttt ccc atg gee cac aac 
Asp Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn 
1085 1090 1095 



18462 



acg gee tct ace ctg gaa gee atg ctg cgc aac gac acc aat gac 
Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp 
1100 1105 1110 



18507 



cag tec ttc aac gac tac ctg tgc gcg get aac 
Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn 
1115 1120 1125 



atg ctg tac ccc 
Met Leu Tyr Pro 



18552 
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ate ccc gee aac gee acc age gtg ccc att tct att cct tct egg 
He Pro Ala Asn Ala Thr Ser Val Pro He Ser He Pro Ser Arg 
1130 1135 1140 



18597 



aac tgg get gee ttc agg ggc tgg agt ttt act cgc etc aaa acc 
Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr 
1145 1150 1155 



18642 



aag gag act ccc teg ctg 
Lys Glu Thr Pro Ser Leu 
1160 1165 



ggc tec ggt ttt gac ccc tac ttt gtt 
Gly Ser Gly Phe Asp Pro Tyr Phe Val 
1170 



18687 



tac tec ggc tec att ccc tac eta gat ggc acc 
Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr 
1175 1180 * 1185 



ttt tac etc aac 
Phe Tyr Leu Asn 



18732 



cac act ttc aaa aag gtg tct att atg ttt gac tec teg gtt age 
His Thr Phe Lys Lys Val Ser He Met Phe Asp Ser Ser Val Ser 
1190 " 1195 1200 



18777 



tgg ccc ggc aac gac cgc 
Trp Pro Gly Asn Asp Arg 
1205 1210 



ctg eta acg ccc aac 
Leu Leu Thr Pro Asn 
1215 



gag ttc gaa att 
Glu Phe Glu He 



18822 



aag cgt tec gtg gac ggt 
Lys Arg Ser Val Asp Gly 
1220 1225 



gaa ggg tac aac gtg gec cag age aac 
Glu Gly Tyr Asn Val Ala Gin Ser Asn 
1230 



18867 



atg acc aag gac tgg ttt eta att caa atg etc agt cac tat aat 
Met Thr Lys Asp Trp Phe Leu He Gin Met Leu Ser His Tyr Asn 
1235 1240 1245 



18912 



ata ggt tac cag ggc ttc tat gtg ccc gag aac tac aag gac cgc 
He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp Arg 
1250 ^ ^ ^ 1255 1260 



18957 



atg tac tec ttc ttc cgc aac ttc caa cca atg age egg cag gtg 
Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val 
1265 ~ 1270 1275 



19002 



gta gat acc gtg act tat aca gac tac aaa gat gtc aag etc ccc 
Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys Asp Val Lys Leu Pro 
1280 1285 1290 



19047 



tac caa cac aac aac tea ggg ttc gtg ggc tac atg gga ccc acc 
Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
1295 1300 1305 



19092 



atg cga gag gga cag gee tac ccg gee aac tat ccc tac ccc ctg 
Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 
1310 1315 1320 



19137 



ate gga gag act gee gta ccc age etc acg cag 
He Gly Glu Thr Ala Val Pro Ser Leu Thr Gin 
1325 1330 1335 



aaa aag ttc etc 
Lys Lys Phe Leu 



19182 
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tgc gac egg gtg atg tgg agg ata ccc ttc tct age aac ttt atg 19227 
Cys Asp Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met 
1340 * 1345 1350 

teg atg ggc tec etc ace gac ctg ggg cag aac atg ctg tac gec 19272 
Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 
1355 * 1360 1365 

aac tec get cac gec ttg gac atg act ttt gag gtg gat ccc atg 19317 
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 
1370 1375 1380 

gat gag ccc acg ctt etc tat gtt ctg ttt gaa gtc ttc gac gtg 19362 
Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val 
1385 1390 1395 

gtg cgc ate cac cag ccg cac cgc ggc gtc ate gag gec gtc tac 19407 
Val Arg He His Gin Pro His Arg Gly Val He Glu Ala Val Tyr 
1400 1405 1410 

ctg cgc aca cct ttc tct gec ggt aac gee ace ace taa agaagctgat 19456 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1415 1420 1425 



y y y L LCCa^C 


yctctc-ciyyciy u 


i_ y d y y uua l» 


fcerfctecrccrac 


ct.aaoct.Qca 


aaccctcrctt 


19516 


4-4-4- nrrrr^a r«r 
l. l. l. y y y uau^ 


4~+*rTra faarff 
w Luy aLaaij u» 


ntt* +" t c c* c era 


att cater tec 


ccccacaagc 


ccrcfcctcfcac 


19576 


categttaac 


acggccggac 


gggagacagg 


gggggtgcac 


tggctcgcct 


tcgcctggaa 


19636 


cccgcgcaac 


cgcacctgct 


acctgttcga 


cccttttggt 


ttctccgacg 


aaaggctgaa 


19696 


gcagatctac 


caattcgagt 


acgaggggct 


cctcaagcgc 


agegctctgg 


cctccacgcc 


19756 


cgaccactgc 


gtcaccctgg 


aaaagtccac 


ccagacggtc 


caggggcccc 


tctcggccgc 


19816 


ctgegggett 


ttctgttgca 


tgtttttgca 


cgccttcgtg 


cactggcctc 


acacccccat 


19876 


ggagegcaac 


cccaccatgg 


atctgctcac 


cggagtgccc 


aacagcatgc 


ttcacagtcc 


19936 


ccaggtcgcc 


cccaccctgc 


gtegcaatea 


ggaccacctg 


tategcttte 


tggggaaaca 


19996 


ctctgcctat 


ttccgccgcc 


accggcagcg 


catcgaacag 


gccacggcct 


tcgaaagcat 


20056 


gagecaaaga 


gtgtaatcaa 


taaaaaccgt 


ttttatttga 


catgataege 


gcttctggcg 


20116 


tttttattaa 


aaatcgaagg 


gttcgaggga 


ggggtcctcg 


tgcccgctgg 


ggagggacac 


20176 


gttgeggtae 


tggaatcggg 


cgctccaacg 


aaactcgggg 


atcaccagcc 


geggcaggge 


20236 


cacgtcttcc 


atgttctgct 


tccaaaactg 


tcgcaccagc 


tgeagggetc 


ccatcacgtc 


20296 


gggegctgag 


atcttgaagt 


cgcagttagg 


gccggagccc 


ccgcggctgt 


tgeggaacac 


20356 


ggggttggca 


cactggaaca 


ccaacacgct 


ggggttgtgg 


atactageca 


gggcegtegg 


20416 


gtcggtcacc 


tecgatgeat 


ccagatcctc 


ggcattgetc 


agggegaacg 


gggtcagctt 


20476 
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gca'catctgc 


cgcccgatct 


ggggtaccag 


gtcgcgcttg 


ttgaggcagt 


cgcagcgcag 


20536 


agggatgagg 


atgcgacgct 


gcccgcgttg 


catgatgggg 


taactcgccg 


ccaggaactc 


20596 


ctctatctga 


cggaaggcca 


tctgggcctt 


gacgccctcg 


gtgaaaaata 


gcccacagga 


20656 


cttgctggaa 


aacacgttat 


tgccacagtt 


gatgtcttcc 


gcgcagcagc 


gcgcatcttc 


20716 


gttcttcagc 


tgaaccacgt 


tgcgacccca 


gcggttctga 


accaccttgg 


ctttcgtggg 


20776 


atgctccttc 


agcgcccgct 


gtccgttctc 


gctggtcaca 


tccatttcca 


ccacgtgctc 


20836 


cttgcagacc 


atctccactc 


cgtggaaaca 


gaacagaatg 


ccctcctgtt 


gggtattgcg 


20896 


atgctcccac 


acggcgcacc 


cggtggactc 


ccagctcttg 


tgtttcaccc 


ccgcgtaggc 


20956 


ttccatgtaa 


gccattagaa 


atctgcccat 


cagctcagtg 


aaggtcttct 


ggttggtgaa 


21016 


ggttagcggc 


aggccgcggt 


gttcctcgtt 


caaccaagtt 


tgacagatct 


tgcggtacac 


21076 


ggctccctgg 


tcgggcagaa 


acttaaaagt 


cgttctgctc 


tcgttgtcca 


cgtggaactt 


21136 


ctccatcaac 


atcgtcatga 


cttccatgcc 


cttctcccag 


gcagtcacca 


gcggcgcgct 


21196 


ctcggggttc 


ttcaccaaca 


cggcggtgga 


ggggccctcg 


ccggccccga 


cgtccttcat 


21256 


ggacattttt 


tgaaactcca 


cggtgccgtc 


cgcgcggcgt 


actctgcgca 


tcggagggta 


21316 


gctgaagccc 


acctccatga 


cggtgctttc 


gccctcgctg 


tcggagacga 


tctccgggga 


21376 


gggcggcgga 


acgggggcag 


acttgcgagc 


cttcttcttg 


ggagggagcg 


gaggcacctc 


21436 


ctgctcgcgc 


tcgggactca 


tctcccgcaa 


gtagggggtg 


atggagcttc 


ctggttggtt 


21496 


ctgacggttg 


gccattgtat 


cctaggcaga 


aagacatgga 


gcttatgcgc 


gaggaaactt 


21556 


taaccgcccc 


gtcccccgtc 


agcgacgaag 


aggtcatcgt 


cgaacaggac 


ccgggctacg 


21616 


ttacgccgcc 


cgaggatctg 


gaggggccct 


tagacgaccg 


gcgcgacgct 


agtgagcggc 


21676 


aggaaaatga 


gaaagaggag 


gaggagggct 


gctacctcct 


ggaaggcgac 


gttttgctaa 


21736 


agcatttcgc 


caggcagagc 


accatactca 


aggaggcctt 


gcaagaccgc 


tccgaggtgc 


21796 


ccttggacgt 


cgccgcgctc 


tcccaggcct 


acgaggcgaa 


ccttttctcg 


ccccgagtgc 


21856 


ctccgaagag 


acagcccaac 


ggcacctgcg 


agcccaaccc 


gcgactcaac 


ttctaccccg 


21916 


tgttcgccgt 


gcccgaggcg 


ctggccacct 


accacatctt 


tttcaaaaac 


cagcgcattc 


21976 


ccctttcctg 


ccgggccaac 


cgcaccgcgg 


ccgataggaa 


gctaacactc 


agaaacggag 


22036 


tcagcatacc 


tgatatcacg 


tcactggagg 


aagtgcctaa 


gatcttcgag 


ggtctgggtc 


22096 


gagatgagaa 


gcgggcggcg 


aacgctctgc 


agaaagaaca 


gaaagagagt 


cagaacgtgc 


22156 
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tggtggagct 


ggagggggac 


aacgcgcgtc 


tgaccgtcct 


caaacgttgc 


atagaagttt 


22216 


cccacttcgc 


ctacccggcc 


ctcaacctgc 


cgcccaaagt 


tatgaaatcg 


gtcatggacc 


22276 


agctactcat 


caagagagct 


gagcccctga 


atcccgacca 


ccctgaggcg 


gaaaactcag 


22336 


aggacggaaa 


gcccgtcgtc 


agcgacgagg 


agctcgagcg 


gtggctggaa 


accagggacc 


22396 


cccagcagtt 


gcaagagagg 


cgcaagatga 


tgatggcggc 


cgtgctggtc 


acggtggagc 


22456 


tagaatgcct 


gcaacggttt 


ttcagcgacg 


tggagacgct 


acgcaaaatc 


ggggagtccc 


22516 


tgcactacac 


cttccgccag 


ggctacgttc 


gccaggcctg 


caaaatctcc 


aacgtagagc 


22576 


tcagcaacct 


ggtttcctac 


atgggcatcc 


tccacgagaa 


ccggctgggg 


cagagcgtgc 


22636 


tgcactgcac 


cttgcaaggc 


gaggcgcgaa 


gggactacgt 


ccgagactgc 


gtctacctct 


22696 


tcctcaccct 


cacctggcag 


accgccatgg 


gcgtgtggca 


gcagtgcttg 


gaagagagaa 


22756 


acctcaaaga 


gctggacaaa 


ctcctctgcc 


gccagcggcg 


ggccctctgg 


accggcttca 


22816 


gcgagcgcac 


ggtcgcctgc 


gccctggcag 


acatcatttt 


cccagaacgc 


ctgatgaaaa 


22876 


ccttgcagaa 


cggcctgccg 


gatttcatca 


gtcagagcat 


cttgcaaaac 


ttccgctcct 


22936 


tcgtcctgga 


gcgctccggg 


atcttgcccg 


ccatgagctg 


cgcgctgcct 


tctgactttg 


22996 


tccccctttc 


ctaccgcgag 


tgccctcccc 


cactgtggag 


ccactgctac 


ctcttccaac 


23056 


tggccaactt 


tctggcctac 


cactccgacc 


tcatggaaga 


cgtgagcgga 


gaggggctgc 


23116 


tcgagtgcca 


ctgccgctgc 


aacctctgca 


ccccccacag 


atcgctggcc 


tgcaacaccg 


23176 


agctgctcag 


cgaaacccag 


gtcataggta 


ccttcgagat 


ccaggggccc 


cagcagcaag 


23236 


agggtgcttc 


cggcttgaag 


ctcactccgg 


cgctgtggac 


ctcggcttac 


ttacgcaaat 


23296 


ttgtagccga 


ggactaccac 


gcccacaaaa 


ttcagtttta 


cgaagaccaa 


tctcgaccac 


23356 


cgaaagcccc 


cctcacggcc 


tgcgtcatca 


cccagagcaa 


aatcctggcc 


caattgcaat 


23416 


ccatcaacca 


agcgcgccga 


gatttccttt 


tgaaaaaggg 


tcggggggtg 


tacctggacc 


23476 


cccagaccgg 


cgaggaactc 


aacccgtcca 


cactttccgt 


cgaagcagcc 


cccccgagac 


23536 


atgccaccca 


agggaaccgc 


caagcagctg 


atcgctcggc 


agagagcgaa 


gaagcaagag 


23596 


ctgctccagc 


agcaggtgga 


ggacgaggaa 


gagctgtggg 


acagccaggc 


agaggaggtg 


23656 


tcagaggacg 


aggaggagat 


ggaaagctgg 


gacagcctag 


acgaggagga 


cgagctttca 


23716 


gaggaagagg 


cgaccgaaga 


aaaaccacct 


gcatccagcg 


cgccttcuCt 


gagccgacag 


9^776 

Z.O 1 / D 


ccgaagcccc 


ggcccccgac 


gcccccggcc 


ggctcactca 


aagccagccg 


taggtgggac 


23836 


gccaccggat 


ctccagcggc 


agcggcaacg 


gcagcgggta 


aggccaaacg 


cgagcggcgg 


23896 



168 



WO 03/046124 



PCIYUS02/33645 



gggtattgct 


cctggcggac ccacaaaagc 


agtatcgtga actgcttgca 


acactgeggg 


23956 


ggaaacatct 


cctttgcccg acgctacctc 


ctcttccatc acggtgtggc 


cttccctcgc 


24016 


aacgttctct 


attattaccg tcatctctac 


agcccctacg aaacgctcgg 


agaaaaaagc 


24076 


taaggcctcc 


tctgccgcga ggaaaaactc 


cgccgccgct gccgccaagg 


atccgccggc 


24136 


caccgaggag 


ctgagaaagc gcatctttcc 


cactctgtat gctatctttc 


ageaaagecg 


24196 


cgggcagcac 


cctcagcgcg aactgaaaat 


aaaaaaccgc tccttccgct 


cactcacccg 


24256 


cagctgtctg 


taccacaaga gagaagacca 


gctgcagcgc accctggacg 


acgccgaagc 


24316 


actgttcagc 


aaatactgct cagcgtctct 


taaagactaa aagacccgcg 


ctttttcccc 


24376 


ctcgggcgcc 


aaaacccacg tcatcgccag 


catgagcaag gagattccca 


ccccttacat 


24436 


gtggagctat 


cagccccaga tgggcctggc 


cgcgggggcc gcccaggact 


actccagcaa 


24496 


aatgaactgg 


ctcagcgccg gcccccacat 


gatctcacga gttaacggca 


tccgagccca 


24556 


ccgaaaccag 


atcctcttag aacaggcggc 


aatcaccgcc acaccccggc 


gccaactcaa 


24616 


cccgcccagt 


tggcccgccg cccaggtgta 


tcaggaaact ccccgcccga 


ccacagtcct 


24676 


cctgccacgc 


gacgcggagg ccgaagtcct 


catgactaac tctggggtac 


aattagcggg 


24736 


cgggtccagg 


tacgccaggt acagaggtcg 


ggccgctcct tactctcccg 


ggagtataaa 


24796 


gagggtgatc 


attcgaggcc gaggtatcca 


gctcaacgac gaggcggtga 


gctcctcaac 


24856 


cggtctcaga 


cctgacggag tcttccagct 


cggaggagcg ggccgctctt 


ccttcaccac 


24916 


tcgccaggcc 


tacctgaccc tgcagagctc 


ttcctcgcag ccgcgctccg 


ggggaategg 


24976 


cactctccag 


ttcgtggaag agttcgtccc 


ctccgtctac ttcaacccgt 


tttccggctc 


25036 


acctggacgc 


tacccggacg ccttcattcc 


caactttgac gcagtgagtg 


aatccgtgga 


25096 


cggctacgac 


tgatgacaga tggtgcggcc 


gtgagagctc ggctgcgaca 


tctgcatcac 


25156 


tgccgccagc 


ctcgctgcta cgctcgggag 


gcgatcgtgt tcagctactt 


tgagctgccg 


25216 


gacgagcacc 


ctcagggacc ggctcacggg 


ttgaaactcg agattgagaa 


cgegcttgag 


25276 


tctcacctca 


tcgacgcctt caccgcccgg 


cctctcctgg tagaaaccga 


aegegggate 


25336 


actaccatca 


ccctgttctg catctgcccc 


acgcccggat tac atg aag ate tgt 


25391 



Met Lys lie Cys 
1430 



gtt gtc ate ttt gcg etc agt tta ata aaa act gaa ctt ttt gcc 25436 
Val Val lie Phe Ala Leu Ser Leu lie Lys Thr Glu Leu Phe Ala 
1435 1440 1445 
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gta cct tea acg cca cgc gtt gtt tct cct tgt gaa aaa acc cca 
Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys Thr Pro 
1450 1455 1460 



25481 



gga gtc ctt aac tta cac ata gca aaa ccc ttg tat ttt acc ata 
Gly Val Leu Asn Leu His lie Ala Lys Pro Leu Tyr Phe Thr lie 
1465 1470 1475 



25526 



gaa aaa caa eta gec ctt tea att gga aaa ggg tta aca att tct 
Glu Lys Gin Leu Ala Leu Ser lie Gly Lys Gly Leu Thr lie Ser 
1480 1485 1490 



25571 



get aca gga cag ttg gaa age aca gca age gta cag gac age get 
Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala 
1495 1500 1505 



25616 



aca cca ccc eta cgt ggt att tec cct tta aag ctg aca gac aac 
Thr Pro Pro Leu Arg Gly lie Ser Pro Leu Lys Leu Thr Asp Asn 
1510 ~ 1515 1520 



25661 



ggt tta aca tta age tat tea gat ccc ctg cgt gtg gta ggt gac 
Gly Leu Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp 
1525 " 1530 1535 



25706 



caa ctt acg ttt aat ttt act tct cca eta 
Gin Leu Thr Phe Asn Phe Thr Ser Pro Leu 
1540 1545 



cgt tac gaa aat ggc 
Arg Tyr Glu Asn Gly 
1550 



25751 



agt ctt aca ttc aac tac act tct ccc atg aca eta ata aac aac 
Ser Leu Thr Phe Asn Tyr Thr Ser Pro Met Thr Leu lie Asn Asn 
1555 " 1560 1565 



25796 



agt ctt get att aac gtc aat acc tec aaa ggc etc agt agt gac 
Ser Leu Ala lie Asn Val Asn Thr Ser Lys Gly Leu Ser Ser Asp 
1570 1575 1580 



25841 



aac ggc aca etc get gta aat gtt act cca gat ttt aga ttt aac 
Asn Gly Thr Leu Ala Val Asn Val Thr Pro Asp Phe Arg Phe Asn 
1585 1590 1595 



25886 



age tct ggt gee tta act ttt ggc ata caa agt eta tgg act ttt 
Ser Ser Gly Ala Leu Thr Phe Gly lie Gin Ser Leu Trp Thr Phe 
1600 1605 1610 



25931 



cca acc aaa act cct aac tgt acc gtg ttt acc gaa agt gac tec 
Pro Thr Lys Thr Pro Asn Cys Thr Val Phe Thr Glu Ser Asp Ser 
1615 1620 1625 



25976 



ctg ctg agt ctt tgc ttg act aaa tgc gga get cac gta ctt gga 
Leu Leu Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly 
1630 ~ 1635 1640 



26021 



age gtg agt tta age 
Ser Val Ser Leu Ser 
1645 



gga gtg gca gga acc atg eta aaa atg acc 
Gly Val Ala Gly Thr Met Leu Lys Met Thr 
1650 1655 



26066 
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cac act tct gtt acc gtt cag ttt teg ttt gat gac agt ggt aaa 
His Thr Ser Val Thr Val Gin Phe Ser Phe Asp Asp Ser Gly Lys 
1660 1665 1670 



26111 



eta ata ttc tct cca ctt gcg aac aac act tgg ggt gtt cga caa 
Leu lie Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly Val Arg Gin 
1675 1680 1685 



26156 



age gag agt ccg ttg ccc aac cca tec ttc aac get etc acg ttt 
Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu Thr Phe 
1690 1695 1700 



26201 



atg cca aac agt acc att tat tct aga gga gca agt aac gaa cct 
Met Pro Asn Ser Thr lie Tyr Ser Arg Gly Ala Ser Asn Glu Pro 
1705 1710 1715 



26246 



caa aac aat tat tat gtc cag acg tat ctt aga ggc aac gtg cga 
Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 
1720 1725 1730 



26291 



aag cca att eta eta 
Lys Pro lie Leu Leu 
1735 



act gtt acc tac aac tea gtt aat tea gga 
Thr Val Thr Tyr Asn Ser Val Asn Ser Gly 
1740 1745 



26336 



tat tec tta act ttt aaa tgg gat get gtc gee aat gaa aaa ttt 
Tyr Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe 
1750 1755 1760 



26381 



gee act cct aca tct teg ttt tgc tat gtt gca gag caa taa 
Ala Thr Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin 
1765 1770 



26423 



aaccctgtta ccccaccgtc tcgttttttt cag atg aaa cga gcg aga gtt 

Met Lys Arg Ala Arg Val 
1775 



26474 



gat gaa gac ttc aac cca gtg tac cct tat gac ccc cca tac get 
Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro Tyr Ala 
1780 1785 1790 



26519 



ccc gtc atg ccc ttc att act ccg cct ttt acc tec teg gat ggg 
Pro Val Met Pro Phe lie Thr Pro Pro Phe Thr Ser Ser Asp Gly 
1795 1800 1805 



26564 



ttg cag gaa aaa cca ctt gga gtg tta agt tta aac tac agg gat 
Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp 
1810 1815 1820 



26609 



ccc att act aca caa aat ggg tct etc acg tta aaa eta gga aac 
Pro lie Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu Gly Asn 
1825 1830 1835 



26654 



ggc etc act eta aac aac 
Gly Leu Thr Leu Asn Asn 
1840 1845 



cag gga cag tta aca tea act get ggc 
Gin Gly Gin Leu Thr Ser Thr Ala Gly 
1850 
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gaa gtg gag cct ccg etc act aat get aac aac aaa ctt gca eta 
Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
1855 1860 1865 



26744 



gec tat age gaa cca tta gca gta aaa age aac cgc eta act eta 
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu 
1870 1875 1880 



26789 



tea cac ace get ccc ctt gtc ate get aat aat tct tta gcg ttg 
Ser His Thr Ala Pro Leu Val lie Ala Asn Asn Ser Leu Ala Leu 
1885 1890 1895 



26834 



caa gtt tea gag cct att ttt gta aat gac gat gac aag eta gee 
Gin Val Ser Glu Pro lie Phe Val Asn Asp Asp Asp Lys Leu Ala 
1900 1905 1910 



26879 



ctg cag aca gec gee ccc ctt gta ace aac get ggc ace ctt cgc 
Leu Gin Thr Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg 
1915 1920 1925 



26924 



tta cag age get gee cct 
Leu Gin Ser Ala Ala Pro 
1930 1935 



tta gga ttg gtt gaa aat act ctt aaa 
Leu Gly Leu Val Glu Asn Thr Leu Lys 
194 0 



26969 



ctg ctg ttt tct aaa ccc ttg tat ttg caa aat gat ttt ctt gca 
Leu Leu Phe Ser Lys Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala 
1945 1950 1955 



27014 



tta gee att gaa cgc ccc ctg get gta gca gee gca ggt act ctg 
Leu Ala lie Glu Arg Pro Leu Ala Val Ala Ala Ala Gly Thr Leu 
I960 1965 1970 



27059 



acc eta caa ctt act cct cca tta aag act aac gat gac ggg eta 
Thr Leu Gin Leu Thr Pro Pro Leu Lys Thr Asn Asp Asp Gly Leu 
1975 1980 1985 



27104 



aca eta tec aca gtc gag cca tta act gta aaa aac gga aac eta 
Thr Leu Ser Thr Val Glu Pro Leu Thr Val Lys Asn Gly Asn Leu 
1990 1995 2000 



27149 



ggc ttg caa ata teg cgc cct tta gtt gtt caa aac aac ggc ctt 
Gly Leu Gin lie Ser Arg Pro Leu Val Val Gin Asn Asn Gly Leu 
2005 2010 2015 



27194 



teg ctt get att acc ccc ccg ctg cgt ttg ttt aac age gac ccc 
Ser Leu Ala lie Thr Pro Pro Leu Arg Leu Phe Asn Ser Asp Pro 
2020 2025 2030 



27239 



gtt ctt ggt ttg ggc ttc act ttt ccc eta get gtc aca aac aac 
Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala Val Thr Asn Asn 
2035 2040 2045 



27284 



etc etc tec tta aac atg gga gac gga gtt aaa ctt acc tat aat 
Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu Thr Tyr Asn 
2050 2055 " 2060 
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aaa eta aca gec aat ttg ggt agg gat tta caa ttt gaa aac ggt 
Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu Asn Gly 
2065 2070 " 2075 



27374 



gcg att gec gta acg ctt act gec gaa tta cct ttg caa tac act 
Ala lie Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr Thr 
2080 2085 2090 



27419 



aac aaa ctt caa ctg aat att gga get ggc ctt cgt tac aat gga 
Asn Lys Leu Gin Leu Asn lie Gly Ala Gly Leu Arg Tyr Asn Gly 
2095 2100 ' 2105 



27464 



gec age aga aaa eta gat 
Ala Ser Arg Lys Leu Asp 
2110 2115 



gta aac att aac caa aat aaa ggc tta 
Val Asn lie Asn Gin Asn Lys Gly Leu 
2120 



27509 



act tgg gac aac gat gca gtt att ccc aaa eta gga teg ggc tta 
Thr Trp Asp Asn Asp Ala Val lie Pro Lys Leu Gly Ser Gly Leu 
2125 2130 " 2135 



27554 



caa ttt gac cct aat ggc aac ate get gtt ate cct gaa ace gtg 
Gin Phe Asp Pro Asn Gly Asn He Ala Val He Pro Glu Thr Val 
2140 2145 2150 



27599 



aag ccg caa acg tta tgg acg act gca gat ccc teg cct aac tgc 
Lys Pro Gin Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys 
2155 2160 2165 



27644 



tea gtg tac cag gac ttg gat gee agg ctg tgg 
Ser Val Tyr Gin Asp Leu Asp Ala Arg Leu Trp 
2170 2175 2180 



etc get ctt gtt 
Leu Ala Leu Val 



27689 



aaa agt ggc gac atg gtg cat gga age att gee eta aaa gee eta 27734 

Lys Ser Gly Asp Met Val His Gly Ser He Ala Leu Lys Ala Leu 
2185 2190 2195 

aaa ggg acg ttg eta aat cct aca gee age tac att tec att gtg 27779 

Lys Gly Thr Leu Leu Asn Pro Thr Ala Ser Tyr He Ser He Val 
2200 2205 2210 



ata tat ttt tac age aac gga gtc agg cgt ace aac tat cca acg 
He Tyr Phe Tyr Ser Asn Gly Val Arg Arg Thr Asn Tyr Pro Thr 
2215 2220 2225 



27824 



ttt gac aac gaa ggc ace tta get aac age gee act tgg gga tac 
Phe Asp Asn Glu Gly Thr Leu Ala Asn Ser Ala Thr Trp Gly Tyr 
2230 2235 2240 



27869 



cga cag ggg caa tct get aac act aat gtg ace aat gee act gaa 
Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Thr Asn Ala Thr Glu 
2245 2250 2255 



27914 



ttt atg ccc age tea age 
Phe Met Pro Ser Ser Ser 
2260 2265 



agg tac ccc gtg aat aaa gga gac aac 
Arg Tyr Pro Val Asn Lys Gly Asp Asn 
2270 
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att caa aat caa tct ttt tea tac acc tgt att aaa gga gat ttt 28004 
lie Gin Asn Gin Ser Phe Ser Tyr Thr Cys He Lys Gly Asp Phe 
2275 2280 2285 

get atg cct gtc ccg ttc cgt gta aca tat aat cac gec ctg gaa 28049 
Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His Ala Leu Glu 
2290 2295 2300 

ggg tat tec ctt aag ttc acc tgg cgc gtt gta gec aat cag gec 28094 
Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn Gin Ala 
2305 2310 2315 

ttt gat att cct tgc tgt tea ttt tea tac ate aca gaa taa 28136 
Phe Asp He Pro Cys Cys Ser Phe Ser Tyr He Thr Glu 
2320 2325 2330 



aaaaccactt 


tttcatttta 


atttcttttt 


attttacacg 


aacagtgaga 


cttcctccac 


28196 


ccttccattt 


gacagcatac 


accagcctct 


cccccttcat 


agcagtaaac tgttgtgaat 


28256 


cacftccacrta 


tttgggagtt 


aaaatccaaa 


cagtctcttt 


ggtgatgaaa 


cgtcgatcag 


28316 


taa fcer era cac 


aaatccetaa 


gacaggtttt 


ccaacgtttc 


ggtgaaaaac 


tgcacaccgc 


28376 


cc tacaaaac 


aaacaaattc 


aggctctcca 


egggttatet 


ccccgatcaa 


actcagacag 


28436 


aat aaa acta 


caataatatt 

w 33 w 3 3^ w 3 w w 


ccactaaacc 


acgcaggtgg 


cgctgtctga 


acctctcggt 


28496 


gcgactcctg 


taaagctggt 


aagaagttag 


attgtccagt 


agcctcacag 


catgtatcat 


28556 


cagtctacga 


gtgcgtctgg 

3 3 J 3 3 


cgcagcagcg 


catctgaatc 


tcactgagat 


teeggcaaga 


28616 


atcgcacacc 


atcacaatca 


ggttgttcat 


gatcccatag 


ctgaacacgc 


tccagccaaa 


28676 


gctcattcgc 


tccaacagcg 


ccaccgcgtg tccgtccaac 


cttactttaa 


cataaatcag 


28736 


gtgtctgccg 


cgtacaaaca 


tgctacccac atacagaact 


tcccggggca 


ggcccctgtt 


<£. O / 3? D 


caccacctgt 


ctgtaccagg 


gaaacctcac 


atttatcagg 


gagecataga 


tggccatttt 


28856 


aaaccaatta 


gctaataccg 


ccccaccagc tctacactga 


agagaacegg gagagttaca 


28916 


atgacagtga 


ataatccatc 


tctcataacc 


cctgatggtc 


tgatgaaaat 


ctagatctaa 


28976 


cgtggcacaa 


caaatacaca 


ctttcatata 


cattttcata 


acatgttttt 


cccaggccgt 


29036 


taaaatacaa 


tcccaataca 


cgggccactc 


ctgeagtaca 


ataaagctaa 


tacaagatgg 


29096 


tatactcctc 


acctcactga 


cactgtgcat gttcatattt 


tcacattcta 


agtaccgaga 


29156 


gttctcctct 


acagcagcac 


tgctgcggtc ctcacaaggt 


ggtagctggt gatgattgta 


29216 


gggggccagt 


ctgeagegat 


accgtctgtc gcgttgcatc 


gtagaccagg 


aaccgacgca 


29276 


cctcctcgta 


cttgtggtag 


cagaaccacg 


tccgctgcca 


gcacgtctcc 


acgtaacgcc 


293[36 


ggtccctgcg 


tcgctcacgc 


tccctcctca atgcaaagtg 


caaccactct 


tgtaatccac 


29396 
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acagatccct 


ctcggcctcc 


ggggtgatgc 


acacctcaaa 


cctacagatg 


tctcggtaca 


29456 


gttccaaaca 


cgtagtgagg 


gcgagttcca 


accaagacag 


acagcctgat 


ctatcccgac 


29516 


acactggagg 


tggaggaaga 


cacggaagag 


gcatgttatt 


ccaagcgatt 


caccaacggg 


29576 


tcgaaatgaa 


gatcccgaag 


atgacaacgg 


tcgcctccgg 


agccctgatg 


gaatttaaca 


29636 


gccagatcaa 


acgttatgcg 


attctccaag 


ctatcgatcg 


ccgcttccaa 


aagagcctgg 


29696 


acccgcactt 


ccacaaacac 


cagcaaagca 


aaagcactat 


tatcaaactc 


ttcaatcatc 


29756 


aagctgcagg 


actgtacaat 


gcctaagtaa 


ttttcgtttc 


tccactcgcg 


aatgatgtcg 


29816 


cggcagatag 


tctgaaggtt 


catcccgtgc 


agggtaaaaa 


gctccgaaag 


ggcgccctct 


29876 


acagccatgc 


gtagacacac 


catcatgact 


gcaagatatc 


gggctcctga 


gacacctgca 


29936 


gcagatttaa 


cagatcaagg 


tcaggttgct 


ctccgcgatc 


acgaatctcc 


atccgcaagg 


29996 


tcatttgcaa 


aaaattaaat 


aaatctatgc 


cgactagatc 


tgtcaactcc 


gcattaggaa 


30056 


ccaaatcagg 


tgtggctacg 


cagcacaaaa 


gttccaggga 


tggtgccaaa 


ctcactagaa 


30116 


ccgctcccga 


gtaacaaaac 


tgatgaatgg 


gagtaacaca 


gtgtaaaatg 


tgcaaccaaa 


30176 


aatcactaag 


gtgctccttt 


aaaaagtcca 


gtacttctat 


attcagtccg 


tgcaagtact 


30236 


gaagcaactg 


tgcgggaata 


tgcacaacaa 


aaaaaatagg 


gcggctcaga 


tacatgttga 


30296 


cctaaaataa 


aaagaatcat 


taaactaaag 


aagcttggcg 


aacggtggga 


taaatgacac 


30356 


gctccagcag 


cagacaggca 


accggctgtc 


cccgggaacc 


gcggtaaaat 


tcatccgaat 


30416 


gattaaaaag 


aacaacagaa 


acttcccacc 


atgtactcgg 


ttggatctcc 


tgagcacaca 


30476 


gcaatacccc 


cctcacattc 


atgtccgcca 


cagaaaaaaa 


acgtcccaga 


tacccagcgg 


30536 


ggatatccaa 


cgacagctgc 


aaagacagca 


aaacaatccc 


tctgggagcg 


atcacaaaat 


30596 


cctccggtga 


aaaaagcaca 


tacatattag 


aataaccctg 


ttgctggggc 


aaaaaggccc 


30656 


ggcgtcccag 


caaatgcaca 


taaatatgtt 


catcagccat 


tgccccgtct 


taccgcgtaa 


30716 


tcagccacga 


aaaaatcgag 


ctaaaattca 


cccaacagcc 


tatagctata 


tatacactcc 


30776 


gcccaatgac 


gctaataccg 


caccacccac 


gaccaaagtt 


cacccacacc 


cacaaaaccc 


30836 


gcgaaaatcc 


agcgccgtca 


gcacttccgc 


aatttcagtc 


tcacaacgtc 


acttccgcgc 


30896 


gccttttcac 


attcccacac 


acacccgcgc 


ccttcgcccc 


gccctcgcgc 


caccccgcgt 


30956 


caccgcacgt 


caccccggcc 


ccgcctcgct 


cctccccgct 


cattatcata 


ttggcacgtt 


31016 


tccagaataa 


ggtatattat 


tgatgatg 
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<210> 30 

<211> 505 

<212> PRT 

<213> simian adenovirus SV-25 

<400> 30 

Met Arg Arg Ala Val Arg Val Thr 
1 5 

Pro Ser Tyr Glu Ser Val Met Gly 
20 

Glu Ala Pro Tyr Val Pro Pro Arg 
35 40 

Asn Ser lie Arg Tyr Ser Glu Leu 
50 55 

Val Tyr Leu Val Asp Asn Lys Ser 
65 70 

Gin Asn Asp His Ser Asn Phe Leu 
85 

Phe Thr Pro Thr Glu Ala Gly Thr 
100 

Ser Arg Trp Gly Gly Gin Leu Lys 
115 120 

Asn lie Asn Glu 
130 

Val Glu Lys Ser Asn Ala Glu Thr 
145 150 

Glu Phe Thr lie Pro Glu Gly Asn 
165 

Leu Met Asn Asn Ala lie Val Asp 
180 

Asn Gly Val Leu Glu Ser Asp lie 
195 200 

Phe Arg Leu Gly Trp Asp Pro Val 
210 215 

Tyr Thr Asn Glu Ala Phe His Pro 
225 230 

Gly Val Asp Phe Thr Gin Ser Arg 
245 



Pro Ala Ala Tyr Glu Gly Pro Pro 
10 15 

Ser Ala Asn Val Pro Ala Thr Leu 
25 30 

Tyr Leu Gly Pro Thr Glu Gly Arg 
45 

Ala Pro Leu Tyr Asp Thr Thr Lys 
60 

Ala Asp lie Ala Ser Leu Asn Tyr 
75 80 

Thr Thr Val Val Gin Asn Asn Asp 
90 95 

Gin Thr lie Asn Phe Asp Glu Arg 
105 110 

Thr lie Leu His Thr Asn Met Pro 
125 



Arg Gin Pro Arg Tyr Glu Trp Phe 
155 160 

Tyr Ser Glu Thr Met Thr lie Asp 
170 175 

Asn Tyr Leu Gin Val Gly Arg Gin 
185 190 

Gly Val Lys Phe Asp Thr Arg Asn 
205 

Thr Lys Leu Val Met Pro Gly Val 
220 

Asp lie Val Leu Leu Pro Gly Cys 
235 240 

Leu Ser Asn Leu Leu Gly lie Arg 
250 255 



Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met 
135 140 
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Lys Arg Arg Pro Phe Gin Glu Gly Phe Gin lie Met Tyr Glu Asp Leu 
260 265 270 

Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala 
275 280 285 

Ser lie Gin Arg Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr 
290 295 300 

Phe Ala Val Ala Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp 
305 310 315 320 

Ser Lys Asp Arg Ser Tyr Asn He He Asn Asn Thr Thr Asp Thr Leu 
325 330 335 

Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
340 345 350 

Val Arg Ser Trp Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser 
355 ~ 360 365 

Gin Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr 
370 375 380 

Phe Arg Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu 
385 390 395 400 

Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
405 410 415 

Ser Gin Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg 
420 425 430 

Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

Leu Arg Ser Ser He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala 
465 470 475 480 

Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala 
485 490 495 

Pro Lys Val Leu Ser Ser Arg Thr Phe 
500 505 

<210> 31 
<211> 921 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 31 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
1 5 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 " 120 125 

Ala Pro Asn Pro Ser Glu Trp Thr Asp Thr Ser Asp Asn Lys Leu Lys 
130 135 140 

Ala Tyr Ala Gin Ala Pro Tyr Gin Ser Gin Gly Leu Thr Lys Asp Gly 
145 150 155 160 

He Gin Val Gly Leu Val Val Thr Glu Ser Gly Gin Thr Pro Gin Tyr 
165 170 175 

Ala Asn Lys Val Tyr Gin Pro Glu Pro Gin He Gly Glu Asn Gin Trp 
180 185 190 

Asn Leu Glu Gin Glu Asp Lys Ala Ala Gly Arg Val Leu Lys Lys Asp 
195 200 205 

Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu 
210 " 215 ~ 220 

Gin Gly Gly Gin Ala Lys Asn Gin Glu Val Asp Leu Gin Phe Phe Ala 
225 230 235 240 

Thr Pro Gly Asp Thr Gin Asn Thr Ala Lys Val Val Leu Tyr Ala Glu 
245 250 255 

Asn Val Asn Leu Glu Thr Pro Asp Thr His Leu Val Phe Lys Pro Asp 
260 265 270 

Asp Asp Ser Thr Ser Ser Lys Leu Leu Leu Gly Gin Gin Ala Ala Pro 
275 280 285 

Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met 
290 295 300 
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Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 ~ 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 375 380 

Gly Gly Met Val Val Thr Asp Asn Tyr Asn Ser Val Thr Pro Gin Asn 
385 390 395 400 

Gly Gly Ser Gly Asn Thr Trp Gin Ala Asp Asn Thr Thr Phe Ser Gin 
405 410 415 

Arg Gly Ala Gin lie Gly Ser Gly Asn Met Phe Ala Leu Glu lie Asn 
420 425 430 

Leu Gin Ala Asn Leu Trp Arg Gly Phe Leu Tyr Ser Asn lie Gly Leu 
435 440 445 

Tyr Leu Pro Asp Ser Leu Lys lie Thr Pro Asp Asn lie Thr Leu Pro 
450 455 460 

Glu Asn Lys Asn Thr Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro 
465 470 475 480 

Gly Leu lie Asp Thr Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp 
485 " 490 495 

Val Met Asp Ser lie Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 
500 505 510 

Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 
515 520 525 

lie Gin Val Pro Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu 
530 535 540 

Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
545 550 555 560 

lie Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser 
565 570 575 

lie Arg Phe Asp Ser lie Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala 
580 585 590 

His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 
595 600 605 
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Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu. Tyr Pro 
610 615 620 

lie Pro Ala Asn Ala Thr Ser Val Pro lie Ser lie Pro Ser Arg Asn 
625 630 635 640 

Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu 
645 650 655 

Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly 
660 665 670 

Ser lie Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys 
675 680 685 

Lys Val Ser lie Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp 
690 695 700 

Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys Arg Ser Val Asp Gly 
705 710 715 720 

Glu Gly Tyr Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu 
725 730 735 

He Gin Met Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val 
740 745 750 



Met Tyr Ser Phe Phe Arg Asn Phe Gin 



Pro Glu Asn Tyr Lys Asp Arg 

755 ^ " 760 

Pro Met Ser Arg Gin Val Val Asp 
770 775 

Asp Val Lys Leu Pro Tyr Gin His 
785 790 

Met Gly Pro Thr Met Arg Glu Gly 
805 

Tyr Pro Leu He Gly Glu Thr Ala 
820 

Phe Leu Cys Asp Arg Val Met Trp 
835 " 840 

Met Ser Met Gly Ser Leu Thr Asp 
850 855 

Asn Ser Ala His Ala Leu Asp Met 
865 870 

Glu Pro Thr Leu Leu Tyr Val Leu 
885 

He His Gin Pro His Arg Gly Val 
900 



765 

Thr Val Thr Tyr Thr Asp Tyr Lys 
780 

Asn Asn Ser Gly Phe Val Gly Tyr 
795 800 

Gin Ala Tyr Pro Ala Asn Tyr Pro 
810 815 

Val Pro Ser Leu Thr Gin Lys Lys 
825 830 

Arg He Pro Phe Ser Ser Asn Phe 
845 

Leu Gly Gin Asn Met Leu Tyr Ala 
860 

Thr Phe Glu Val Asp Pro Met Asp 
875 " 880 

Phe Glu Val Phe Asp Val Val Arg 
890 895 

He Glu Ala Val Tyr Leu Arg Thr 
905 910 
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Pro Phe Ser Ala Gly Asn Ala Thr Thr 
915 920 



<210> 32 
<211> 347 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 32 

Met Lys lie Cys Val Val lie Phe Ala Leu Ser Leu lie Lys Thr Glu 
15 10 15 

Leu Phe Ala Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys 
20 25 30 

Thr Pro Gly Val Leu Asn Leu His lie Ala Lys Pro Leu Tyr Phe Thr 
35 40 45 

lie Glu Lys Gin Leu Ala Leu Ser lie Gly Lys Gly Leu Thr lie Ser 
50 " 55 60 

Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala Thr 
65 - 70 75 80 

Pro Pro Leu Arg Gly lie Ser Pro Leu Lys Leu Thr Asp Asn Gly Leu 
85 90 95 

Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp Gin Leu Thr 
100 105 110 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly Ser Leu Thr Phe 
115 120 125 

Asn Tyr Thr Ser Pro Met Thr Leu lie Asn Asn Ser Leu Ala lie Asn 
130 135 140 

Val Asn Thr Ser Lys Gly Leu Ser Ser Asp Asn Gly Thr Leu Ala Val 
145 ~ 150 155 160 

Asn Val Thr Pro Asp Phe Arg Phe Asn Ser Ser Gly Ala Leu Thr Phe 
165 170 175 

Gly lie Gin Ser Leu Trp Thr Phe Pro Thr Lys Thr Pro Asn Cys Thr 
180 185 190 

Val Phe Thr Glu Ser Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
195 200 205 

Gly Ala His Val Leu Gly Ser Val Ser Leu Ser Gly Val Ala Gly Thr 
210 215 220 

Met Leu Lys Met Thr His Thr Ser Val Thr Val Gin Phe Ser Phe Asp 
225 230 235 240 
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Asp Ser Gly Lys Leu lie Phe Ser 
245 

Val Arg Gin Ser Glu Ser Pro Leu 
260 

Thr Phe Met Pro Asn Ser Thr lie 
275 280 

Pro Gin Asn Asn Tyr Tyr Val Gin 
290 295 

Lys Pro lie Leu Leu Thr Val Thr 
305 310 

Ser Leu Thr Phe Lys Trp Asp Ala 
325 

Pro Thr Ser Ser Phe Cys Tyr Val 
340 



Pro Leu Ala Asn Asn Thr Trp Gly 
250 255 

Pro Asn Pro Ser Phe Asn Ala Leu 
265 270 

Tyr Ser Arg Gly Ala Ser Asn Glu 
285 

Thr Tyr Leu Arg Gly Asn Val Arg 
300 

Tyr Asn Ser Val Asn Ser Gly Tyr 
315 320 

Val Ala Asn Glu Lys Phe Ma Thr 
330 335 

Ala Glu Gin 
345 



<210> 33 
<211> 559 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 33 

Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Pro Pro Tyr Ala Pro Val Met Pro Phe lie Thr Pro Pro Phe Thr 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 
35 40 45 

Tyr Arg Asp Pro lie Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu 
50 55 60 

Gly Asn Gly Leu Thr Leu Asn Asn Gin Gly Gin Leu Thr Ser Thr Ala 
65 70 75 80 

Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
85 90 95 

Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu Ser 
100 105 110 

His Thr Ala Pro Leu Val lie Ala Asn Asn Ser Leu Ala Leu Gin Val 
115 120 125 

Ser Glu Pro lie Phe Val Asn Asp Asp Asp Lys Leu Ala Leu Gin Thr 
130 135 140 
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Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys Leu Leu Phe Ser Lys 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala Leu Ala lie Glu Arg Pro 
180 185 190 

Leu Ala Val Ala Ala Ala Gly Thr Leu Thr Leu Gin Leu Thr Pro Pro 
195 200 205 

Leu Lys Thr Asn Asp Asp Gly Leu Thr Leu Ser Thr Val Glu Pro Leu 
210 215 220 

Thr Val Lys Asn Gly Asn Leu Gly Leu Gin lie Ser Arg Pro Leu Val 
225 " 230 235 240 

Val Gin Asn Asn Gly Leu Ser Leu Ala lie Thr Pro Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala 
"260 265 270 

Val Thr Asn Asn Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu 
275 280 285 

Thr Tyr Asn Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu 
290 295 300 

Asn Gly Ala lie Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr 
305 310 315 320 

Thr Asn Lys Leu Gin Leu Asn lie Gly Ala Gly Leu Arg Tyr Asn Gly 
325 330 335 

Ala Ser Arg Lys Leu Asp Val Asn lie Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Asp Asn Asp Ala Val lie Pro Lys Leu Gly Ser Gly Leu Gin Phe 
355 360 365 

Asp Pro Asn Gly Asn lie Ala Val lie Pro Glu Thr Val Lys Pro Gin 
370 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Val Tyr Gin 
385 390 395 400 

Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val Lys Ser Gly Asp Met 
405 410 415 

Val His Gly Ser lie Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Asn 
420 425 430 

Pro Thr Ala Ser Tyr He Ser He Val He Tyr Phe Tyr Ser Asn Gly 
435 440 445 
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Val Arg Arg Thr Asn Tyr Pro Thr 
450 455 

Asn Ser Ala Thr Trp Gly Tyr Arg 
465 470 

Val Thr Asn Ala Thr Glu Phe Met 
485 

Asn Lys Gly Asp Asn lie Gin Asn 
500 

Lys Gly Asp Phe Ala Met Pro Val 
515 520 

Ala Leu Glu Gly Tyr Ser Leu Lys 
530 535 

Gin Ala Phe Asp lie Pro Cys Cys 
545 550 



Phe Asp Asn Glu Gly Thr Leu Ala 
460 

Gin Gly Gin Ser Ala Asn Thr Asn 
475 480 

Pro Ser Ser Ser Arg Tyr Pro Val 
490 495 

Gin Ser Phe Ser Tyr Thr Cys lie 
505 " 510 

Pro Phe Arg Val Thr Tyr Asn His 
525 

Phe Thr Trp Arg Val Val Ala Asn 
540 

Ser Phe Ser Tyr lie Thr Glu 
555 



<210> 34 

<211> 34115 

<212> DNA 

<213> simian adenovirus SV-39 
<220> 

<221> CDS 

<222> (13448) . . (14959) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (17785) . . (20538) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (29515) . . (31116) 

<223> L5 Fiber #1 



<400> 34 



catcatcaat 


ataacaccgc 


aagatggcga 


ccgagttaac 


atgcaaatga ggtgggcgga 


60 


gttacgcgac 


ctttgtcttg 


ggaacgcgga 


agtgggcgcg 


gcgggtttcg gggaggagcg 


120 


cggggcgggg 


cgggcgtgtc 


gcgcggcggt 


gacgcgccgg 


ggacccggaa attgagtagt 


180 


ttttattcat tttgcaagtt 


tttctgtaca 


ttttggcgcg 


aaaactgaaa cgaggaagtg 


240 


aaaagtgaaa 


aatgccgagg 


tagtcaccgg gtggagatct 


gacctttgcc gtgtggagtt 


300 


tacccgctga 


cgtgtgggtt 


tcggtctcta 


ttttttcact 


gtggttttcc gggtacggtc 


360 
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aaa rrrr +* rrrp 
dddy y LL lll 


a t" t i~ i" a t era c 


tccaccr tcacr 

W'wW't-* WW?V*4.^ 


ctgategcta 


aaatatt taa 


tgcgcctcag 


420 


d l l y Loaay a 


fiffccactctt 

y y ^ V^Cl V« I* U U 


aa.aiiac.caac 


gagaagagtt 


t tctcctccg 


cgttccgcca 


480 


a r«i~ ret - rra a a a 
d l L y Ly aaaa 


Go. uy ay y qciu 


ttcttactat 


cticcaaaact. 


crccaCTceracc 


Qtacrccercccf 

\^ wuy *w»w»u w^w>w 


540 


agCLy l ty y a. 


rrrra r* a t~ 4- rr t~ +" 
yyaua tuy l l 


a cccrcra rrr'i - r* 

auuyyay llo 


trrr/rra rra cna 

vu y y ci y ci Ly a 


tcct caercrtcr 

k> wuu>ay y y 


atttctcaet 

Vp* V- U I— w u uuv l» 


600 


ULLy Lgddya 


ff-f fa rrt r*t t 

LtULdy LULL 


r*a1" rra f* r*fr rt" 

L>0 L y CL uLLUL 


afrrafattrra 

a Ly a La l» y a 


t c c er cr er t cr 1 1 

L^ L* Ly U U UU WW 


ct a cr cr crcr ea a cr 

y ci y y y y u>aay 


660 


ay y atg ao uy 


rr r» 4- nrra rrt r*t~ 
y l Lyy ay lll 


rrt" nets tff cirri" 
y Lyyauyyy l 


t'Ttttccrrrra 

l> w v» i* i<» l Ly y a 


c cr c t a t cr c t ct 


c t a CT a CT Cf C t CT 

W C4 y C4- \A \** W ^ 


720 


a"}-4-4-rrr*r*a r"r* 
d l l L-La^u 


Cl l L LL>aL>QQ^ 


tctcacactcr 


arjccccracrtc 


agetgeta t t 


cctgaattgt 


780 


r*a t* f* a rf eft* rra 

La L > — a y y l y a 


a rttrracttcr 

aLLLyaLUUy 


gefctg ttacg 


agactatgee 


t c c cr cr a cr t c cr 


aataaaaacra 

yuuyu.yyu.yy 


840 


ar*arrr , rrrrrrat 
duay uyyyaL 


r*a rrrrra t rrc 
Lay Lya lll l 


accrcrctttta 

a Lyy u l <• c> cv 


t cf a tctctaa 


ggega t tget 


atactaaaag 


900 


is !a rr » 4~ rc 4~ 

aayatydL^d 


+*rrrrr'rfatrfat" 

uyy LyaLydL 


rrrr a 1~ +~ +■ r*rra r* 
y y d l l LLy aL 


4- rrrra r*rrp"t" rr 
Ly y a Ly l l»ll 


cr cr c er rr t er c c er 

y y Ly y i~ y w Ly 


crCTCTacraCTact 

y y y u y u y u u> u 


960 


r*^r a a rr+~ r»r*^" rt 

ytoagLCCty 


frfaa^arra r* 
Ly da LdLLa L 


rrt tt ft a +~ rrf +" a 

7 39 7 Ld 


r* r rrrra rra ccr* 
LLyyayaLLL 


ert ct a totter 

y w w uy uuu 


tattctctat 

W %J W W W W W 


1020 


guLdicuccy 


LLL Lda L>y L L 


y L L L L L y LLL 


a r*a rrt~ a a rrt" rr 
dLdy Laay Ly 


ttttatcrctt 

l« i>> w u>y u y v»» w w 


ttttaccctcr 

w w w w ti w> wv w V4 


1080 




ydy LLLaLLL 


LL LLLLy Ly L 


rhrat*a rrrr rrt" 
L LLa Lay y y i> 


crttcrtttatt 

y u uu u u u a w u 


atacrcrtcctrf 

u vuuyj w w w- w 


1140 


+-4-4- r~> -a rr a 4- rr+* 
lll Ldy auy l 


rr rra rrrraarr't' 
y y ay y aaLLL 


rra fa rr +- a r*i"a 
yd Lay lclllci 


ct* rrrra a a t era 

Lyy uu a Ly a 


CTCTacrcraaaacr 

U U C4 \J w& W U U 


ccctccccgc 


1200 


a a a r*t" a a r» 
uyaaaLLaai< 


r*a rrr*rr r t frr* 
l Lay Ly LLy l 


a rra rri*aai"a 
ayaLLLaa l a 


t - t"t" t rra era c:c: 

l» L» v» *^M ^ M L w 


ctccfcr cccacr 


cgtgtgtcat 


1260 


r* tr* rr rr a a r*rr 
LLLyyaaaLy 


irrci" fret rrt t 
Ly LLy LLy l l 


aattcrcatacf 

o ct w «— y w u. w cl y 


aacrattta tt 

Ci V* y Www (4 w w 


ggaagagece 


actgaacctt 


1320 


t" rrrra r*t t rft r* 


rthaa a cr err a 

U U CI U y Ly u 


cccccrccccrc 


aatacroaccrc 

w4 N4 W y V/ \^ 


ggtgccagt fc 


ttttctctct 


1380 


arrr , +~t~r*r , rrrrrr 
ay uULULyy y 


4- rra r* t r* a rr 1" rr 
LyaLLLay Ly 


raai*aaaaai" 
Ldd Laaaaa l 


t"t"t* cttercrcia 

i>wv»Li*i»yy 


acacrrrtcrtat 


gtgtttactt 


1440 


Uauy y y t-y y y 


day y y a l Lay 


rr rrrr a rr1~ af aa 
yyyay l a l a a 


a rrr*t" rrrra rrrrrr 
ayLLyyayyy 


rraaaaatcta 

y cici ci uu u w uy 


aggctgtcag 


1500 


2 4- /-"^f a rr 4~ rfa rr 
aLCydytyay 


a a rrt t <-*r*a4rrr 
day L LLLa Ly 


na r*+*i~rr+"a r* rr 
y a l l l y l a l y 


a rra rrr'r't"a era 
ayayLLLaya 


era a tctaacrt 

y ci ci w ^ uuu y u 


tetttgegae 


1560 


rrt" t+~rrr , trrrra 


rfrra rrrr r* <*"• t rr 
yyciyy l l l l l 


rf a ranaacrf" 
yaLayaaLL l 


cttacatttcr 


gaggtttctg 


ttcggttccc 


1620 


r*t r*t rra rrt r*rr 
ULL-uyay l l y 


ctttttrrca c 

U> U U U U y UrCl "w 


c a cj a t ct a a a c 

wy MM l y ci a y w 


cracracrcacct 

y l4. \+a ka y v> l^ w 


gaeggaattt 


aatqaacttt 

yuuyyuww.ww. 


1680 


+- = rra cr c a rf p t 
l a yoyuay 


ercctcrcr a ctcr 

y u- u» u y y a w w y 


tttaattctt 

L' L^ W L» W O 


tgaatctegg 


ccaccggacg 


ctgetagagg 


1740 


a rra rrrr r 1 1 1 1 


tccacaatto 

w cl u»u ci i— »y 


rra cttttcct 

y ci w w i« l» Lm v-» w 


ctccaCTcrccer 


tctgtgttca 


acCTcttactt 


1800 


t"trrr't~rrta fa 

toy LLy UaUa 


t ctrrttrrrra r* 

LLLy uuyyaLr 


a era t* orcraaccr 

a y a l y y a a Ly 


aacacracCTca 


gctcagcccg 


ggttacactc 


1860 


frrrra r* 1 1" r*4~ 
tyydutLLuu 


rra i-* rr r* t a +~ rr r» 
y«Ly l La ty l 


r*i~ a t - rrrra a rrt" 
LLa Lyyaay l 


t" r*rrrra a tcacr 

Ly y ci a i* va*ct y 


cr a cr cr cr cr cr a cr cr 


aagctg tacg 


1920 


ggcgcttggt 


ggagaggcat 


ccgtctctgc 


gccagcagcg 


tetgeaaget 


caagtgctgc 


1980 


tgaggcggga 


ggatctggaa 


gecatttegg 


aggaggagag 


cggcatggaa 


gagaagaatc 


2040 


cgagagcggg 


gctggaccct 


ceggeggagg 


agtagggggg 


ataccggacc 


cttttcctga 


2100 
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gttggctttg ggggcggtgg ggggcgcttc tgtggtacgt gaggatgaag aggggcgcca 2160 

acgcggtcag aagagggagc attttgagtc ctcgactttc ttggctgatg taaccgtggc 2220 

cctgatggcg aaaaacaggc tggaggtggt gtggtacccg gaagtatggg aggactttga 2280 

gaagggggac ttgcacctgc tggaaaaata taactttgag caggtgaaaa catactggat 2340 

gaacccggat gaggactggg aggtggtttt gaaccgatac ggcaaggtag ctctgcgtcc 2400 

cgactgtcgc taccaggttc gcgacaaggt ggtcctgcga cgcaacgtgt acctgttggg 2460 

caacggcgcc accgtggaga tggtggaccc cagaaggggt ggttttgtgg ccaatatgca 2520 

agaaatgtgc cctggggtgg tgggcttgtc tggggtgact tttcatagtg tgaggtttag 2580 

cggtagcaat tttgggggtg tggttattac cgcgaacact cctgtggtcc tgcataattg 2640 

ctactttttt ggcttcagca acacctgtgt ggaaatgagg gtgggaggca aagtgcgcgg 2700 

gtgttccttt tacgcttgct ggaagggggt ggtgagccag ggtaaggcta aagtgtctgt 2760 

tcacaagtgt atgttggaga gatgcacctt gggcatttcc agtgagggct tcctccacgc 2820 

cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 28 80 

ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940 

ctgcacagat ggcaaggtgc gcatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000 

ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060 

gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120 

atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180 

gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240 

cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300 

gcactcttgc ctgcgggagg agttttcttc ctcggacgag gaggactagg taggtggttg 3360 

gggcgtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420 

tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt 3480 

gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540 

cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600 

tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3660 

tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga 3720 

cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3780 
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ccrtcraa ccaa 


caattacaac 


tcrcrtcrtcaac 
**yy **y uu r wv 


ccgagtggag 

3 3 33 3 


caactttccc 


gccctcccca 


3840 


otaacco caa 


aaattcaata 


aacagaattt 


aataaacagc 


acttgagaaa 


agtttaaact 


3900 


totoottoac 


tttattcctg 


aatacrctaaa 


aaqagggaac 

333333 


ggcgggaacg 

3 3 3 3 3 3 


gtaagacctg 


3960 


otccatccrtt 

VJ WW>W>t*WWX4 WW 


cccggtcgtt 


aaaaacacaa 

yuyuuwu.wyy 


tggatttttt 


ccaagacccg 


atagaggtgg 

3 3 3 3 3 


4020 


rrt*r*tcraaccft 


toaoatacat 


aaacataaac 


ccatctcgag 

3 *-^*-^-y y y 


ggtggaggta 

yy v 33 33 


ggcccactgc 


4080 


aoaoectcot 


tfct caaaaat 


aatattataa 
y y w y *»*»y 


atgatccagt 


cgtaggcccc 


ccgctgggcg 

3 3 3 3 3 


4140 


t cf cr t o c t a a a 


aoatatcctt 


cagcagcaag 


ctgatggcaa 


egggaagace 

^333^^3 


cttggtgtag 


4200 


otcrttoacaa 


ageggttgag 


ttaaaaaacra 
w *"yy y **y y y y 


tgcatgcggg 

3 3 3 3 3 


gactgatgag 

3 3 3 3 


gtgcattttg 


4260 


occtooatct 


toaacttcrac 

V— t-4 *H w W 


tatgttgccg 


cccagatcgc 


gectgggatt 

3 3 3 3 


catgttatgc 


4320 


aacracca cca 


ocaccoacfta 


accggtgcag 


cacrgggaatt 
w y yyyy 


tgtcgtgcag 

w 3 w ^3 v -3 w 3 


cttggaaggg 

3 3 333 


4380 


aaa Ofot CTOa 


aaaatttcrcra 


oacccctccro 

\£ t*. W W» V-*" W *w u w 


tgcccgccta 


ggttttccat 


gcactcatcc 


444Q 


a toatoa too 


ncratcfcfcfccc 


w> w» *j y « ^ 


gcctgggcaa 
y^^^yyy 


aaacgttgcg 


ggggtccgtg 

3333 3 3 


4500 


a cat cot a at 


totoot cctcr 


ggtgagttca 


tcataggaca 


ttttgacaaa 


gcgcgggcag 

3 3 333 3 


4560 


aaGfltcccaa 

b4 y \A w n— ' W* %w u y 


actggggaat 


gatggttcca 


tccggtccgg 


gggcgtagtt 

3 3 3 3 3 


gccctcgcag 


4620 


atttacattt 


cccaggcttt 


gatttcagag 


ggagggatca 
y y ^* 3 3 3 


tgtcaacctg 


gggggcgatg 


4680 


aaaaaaatcfd 


tefcetgggge 


yyyyy*'y t * v *y 


agctgggtgg 

"3333 33 


aaagcaggtt 


gegcaagage 

3 3 3 7 


4740 


tgtgacttgc 


cgcagccggt 


y y y y y 


atgacagcta 


tgacgggttg 

3 3 3 3 3 


cagggtgtag 


4800 


tttaoaoaac 

W W W *-4 U. \J d \J w» 


tacaactgcc 


atcatccttc 


aaaagcgggg 

3 v 333 3 


ccacactgtt 


taaaagttct 


4860 


eta a ca tot a 


aatttt ccca 

GL \M W W w W W> W W» U 


cactaagtcc 


tgeaggagae 


gtgaccctcc 


tagggagaga 


4920 


agtt caggaa 


gcgaagcaaa 


gtttttaagt 


ggcttgaggc 

33 wk -"3 33 


catcggccaa 


gggcaagttc 


4980 


ctgagagttt 


aactaaacaa 


ttccagccgg 


tcccagagct 


cggttacgtg 

3 3 3 3 


ctctacggca 


5040 


tctcgatcca 


gcagacctcc 


tcatttcacrcr 


ggttggggcg 


gctctggctg 

3 33 3 


tagggaatga 


5100 


a a ccrcr t crcrcf c 


atccaactaa 


accataatac 

y w y y 3 


ggtccctcca 


tgggegcagg 


gttctcttca 


5160 


aaatoatctc 

y y y y y 




aat aaatcrcrcr 

aauyyy ^yyy 


ccccgggctg 


ggcgctggcc 

33 3 3 3 


agggtgeget 


5220 


taacfcrctcrao 

W \4 ^ y W \J V4 


acaactaatcr 

y^yyL.uyy 


gcgaaccgtt 


gettttegtc 


tccctgcaag 


teagecaaat 


5280 


au uwa y y civ* 


cataaactca 
wa y y wb* 


taatccaaac 

wwi\4 u vwwwm w 


tctctgcggc 


atgtcctttg 


gegegaauct 

3 3 3 ^ 3 


5340 


tgcctttgga 


aacgtgcccg 


cagtttgagc 


agagcaagca 


ttttagcgcg 


tagagttt tg 


OflUU 


gcgccaagaa 


cacggattcc 


ggggaataag 


catccccacc 


gcagttggag 


caaacggttt 


5460 


cgcattccac 


cagccaggtc 


agctgaggat 


cttttgggtc 


aaaaaccaag 


cgcccgccgt 


5520 
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tttttttgat gcgcttccta cctcgggtct ccatgaggcg gtgcccgcgt tcggtgacga 5580 

agaggctgtc ggtgtctccg tagacggagg tcagggcgcg ctcctccagg ggggtcccgc 5640 

ggtcctcggc gtagagaaac tcgcaccact ctgacataaa cgcccgggtc caggctagga 5700 

cgaatgaggc gatgtgggaa gggtaccggt cgttatcgat gagggggtcg gttttttcca 5760 

aggtgtgcag gcacatgtcc ccctcgtccg cttccaaaaa tgtgattggc ttgtaggtgt 5820 

aagtcacgtg atcctgtcct tccgcggggg tataaaaggg ggcgtttccc ccctcctcgt 5880 

cactctcttc cggttcgctg tcgccaaagg ccagctgttg gggtacgtaa acgcgggtga 5940 

aggcgggcat gacctgtgcg ctgaggttgt cagtttctat atacgaggaa gatttgatgg 6000 

cgagcgcccc cgtggagatg cccttgaggt gctcggggcc catttggtca gaaaacacaa 6060 

tctgtcggtt atcaagcttg gtggcaaaag acccgtagag ggcgttggag agcaacttgg 6120 

cgatggagcg ctgggtttgg tttttttccc ggtcggcttt ttccttggcc gcgatgttga 6180 

gctggacgta ctccctggcc acgcacttcc agccgggaaa aacggccgtg cgctcgtccg 6240 

gcaccagcct cacgctccat ccgcggttgt gcagggtgat gacgtcgatg ctggtggcca 6300 

cctctccgcg caggggctcg ttggtccagc agaggcgacc gcccttgcga gagcagaagg 6360 

ggggcagggg gtcaagcagg cgctcgtccg gggggtcggc gtcgatggta aagatggcgg 6420 

gcagcaggtg tttgtcaaag taatcgatct gatgcccggg gcaacgcagg gcggtttccc 6480 

agtcccgcac cgccaaggcg cgctcgtatg gactgagggg ggcgccccag ggcatgggat 6540 

gcgtcagggc cgaggcgtac atgccgcaga tgtcatagac gtaaaggggc tcctccagga 6600 

cgccgaggta ggtggggtag cagcgccccc cgcggatgct ggcccgtacg tagtcgtaga 6660 

gctcgtgcga gggggccaga aggtggcggc tgaggtgagc gcgctggggc ttttcatctc 6720 

ggaagaggat ctgcctgaag atggcgtggg agttggagga gatggtgggc cgctgaaaaa 67 80 

tgttgaagcg ggcgtcgggc agacccacgg cctcgccgat aaagtgggcg taggactctt 6840 

gcagcttttc caccagggag gcggtgacca gcacgtccag agcgcagtag tccagggttt 6900 

cccgcacgat gtcataatgc tcttcctttt tttccttcca gaggtctcgg ttgaagagat 6960 

actcttcgcg gtctttccag tactcttgga gaggaaaccc gttttcgtct ccacggtaag 7020 

agcccaacat gtaaaactgg ttgacggcct gatagggaca gcatcccttc tccacgggca 7080 

gcgagtaggc cagggcggcc ttgcgcaggg aggtgtgagt cagggcaaag gtgtcgcgga 7140 

ccataacttt tacaaactgg tacttaaagt cccggtcgtc gcacatgcct cgctcccagt 7200 
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ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 7260 

agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 7320 

ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 7380 

gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 7440 

gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 7500 

ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560 

cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620 

tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 7680 

gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 7740 

cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 7800 

gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 7860 

ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgtgctgg tttttgtaaa 7920 

agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatcttgc 7980 

gtcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040 

cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100 

ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160 

cgtgaatacg gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 8220 

gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 8280 

aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtctttatcg 8340 

gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 8400 

cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 8460 

ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 8520 

cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 8580 

ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 8640 

ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700 

atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 8760 

tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820 

gagttccacg tggcgggcga aaaccggata gtttcggagg cgctgataca gatagttgag 8880 

ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagctc 8940 
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gttgatgtcg 


cccagcgcct 


ccaggcgttc 


catggcctcg 


taaaagtcca 


cggcaaagtt 


9000 


gaaaaattgg 


ctgttcctgg 


ccgagaccgt 


gagctcttct 


tccaagagcc 


gaatgagatc 


9060 


cgccacggtg 


gccctgactt 


cgcgttcgaa 


agccccgggt 


gcctcctcca 


cctcttcctc 


9120 


ctcgacttct 


tcgaccgctt 


cgggcacctc 


ctcttcctcg 


accaccacct 


caggcggggc 


9180 


tcggcggcgc 


cggcggcgga 


cgggcaggcg 


gtcgacgaaa 


cgctcgatca 


tttcccccct 


9240 


ccgtcgacgc 


atggtctcgg 


tgacggcgcg 


accctgttcg 


cgaggacgca 


gggtgaaggc 


9300 


gccgccgccg 


agcggaggta 


acagggagat 


cggggggcgg 


tcgtggggga 


gactgacggc 


9360 


gctaactatg 


catctgatca 


atgtttgcgt 


agtgacctcg 


ggtcggagcg 


agctcagcgc 


9420 


ttgaaaatcc 


acgggatcgg 


aaaaccgttc 


caggaacgcg 


tctagccaat 


cacagtcgca 


9480 


aggtaagctg 


aggaccgtct 


cgggggcttg 


tctgttctgt 


cttcccgcgg 


tggtgctgct 


9540 


gatgaggtag 


ttgaagtagg 


cgctcttgag 


gcggcggatg 


gtggacagga 


gaaccacgtc 


9600 


tttgcgccca 


gcttgctgta 


tccgcaggcg 


gtcggccatg 


ccccacactt 


ctccttgaca 


9660 


gcggcggagg 


tccttgtagt 


attcttgcat 


cagcctttcc 


acgggcacct 


cgtcttcttc 


9720 


ttccgctcgg 


ccggacgaga 


gccgcgtcag 


gccgtacccg 


cgctgcccct 


gtggttggag 


9780 


cagggccagg 


tcggccacga 


cgcgctcggc 


cagcacggcc 


tgctggatgc 


gggtgagggt 


9840 


gtcctgaaag 


tcgtcgagat 


ccacaaagcg 


gtggtacgcg 


ccagtgttga 


tggtgtaggt 


9900 


gcagttgctc 


atgacggacc 


agtttacggt 


ctgggtgcca 


tggcccacgg 


tttccaggta 


9960 


gcggagacgc 


gagtaggccc 


gcgtctcgaa 


gatgtagtcg 


ttgcaggtcc 


gcagcaggta 


10020 


ctggtagccc 


accagcagat 


gcggcggcgg 


ctggcggtag 


aggggccacc 


gctgggtggc 


10080 


gggggcgttg 


ggggcgagat 


cttccaacat 


gaggcggtga 


tagccgtaga 


tgtagcgcga 


10140 


catccaagtg 


atgccgctgg 


ccgtggtgct 


ggcgcgggcg 


tagtcgcgaa 


cgcggttcca 


10200 


gatgtttcgc 


agcggctgga 


agtactcgat 


ggtggggcga 


ctctgccccg 


tgaggcgggc 


10260 


gcagtcggcg 


atgctctacg 


gggaaaaaga 


agggccagtg 


aacaaccgcc 


ttccgtagcc 


10320 


ggaggagaac 


gcaagggggt 


caaagaccac 


cgaggctcgg 


gttcgaaacc 


cgggtggcgg 


10380 


cccgaatacg 


gagggcggtt 


ttttgctttt 


ttctcagatg 


catcccgtgc 


tgcggcagat 


10440 


gcgtccgaac 


gcggggtccc 


agtccccggc 


ggtgcctgcg 


gccgtgacgg 


cggcttctac 


10500 


ggccacgtcg 


cgctccaccc 


cgcctaccac 


ggcccaggcg 


gcggtggctc 


tgcgcggcgc 


10560 


aggggaaccc 


gaagcagagg 


cggtgttgga 


cgtggaggag 


ggccaggggt 


tggctcggct 


10620 
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gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680 

gtacgtgccg cggagcaatc tgtttcgcga ccgcagcggc gaggaggccg aggagatgcg 10740 

agacttgcgt tttcgggcgg ggagggagtt gcgtcacggg ctggaccggc agagggttct 10800 

gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 10860 

ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920 

gaagagcttc aatcatcacg tgcgcacgct gattgcgcgc gaagaggtgg ccatcggcct 10980 

catgcatctg tgggattttg tggaggcgta cgttcagaac cccagcagca agccgctgac 11040 

ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca gggaggccat 11100 

gctgaacatt gcagagcctg aggggcgctg gctcttggat ctcattaaca tcttgcagag 11160 

tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaactacag 11220 

catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgt tcgtccccat 11280 

agacaaggag gtgaagatag acagctttta catgcgcatg gcgctcaagg tgctgactct 11340 

aagcgacgac ctgggggtgt accgcaacga ccgcatacac aaggcggtga gcgccagccg 11400 

ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460 

cggcagcgcc gaggcggccg agtactttga cgccggagcg gacttgcgct ggcagccatc 11520 

ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580 

ggacgacgag ttgggcattg acttgtagcc gtttttcgtt agatatgtcg gcgaacgagc 11640 

cgtctgcggc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700 

cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760 

gcatcctggc gctgacggcg cgcaaccccg aggcttttcg gcagcagccg caggcaaacc 11820 

ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880 

tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 11940 

aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000 

atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060 

aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120 

acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 12180 

ccgaggtgcc gcagagcgag gtgtaccagt ctggccccaa ctactacttc cagacctccc 12240 

ggcagggcct gcagacggta aacctgacgc aggcctttca gaacctgcag ggcctttggg 12300 

gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360 
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ggctgctctt 


gcttctcatt 


gctccgttca ccgacagcgg ttccatcagc 


cgcgactctt 


12420 


acctgggaca 


cctgctcacc 


ctgtaccggg aggecategg geaggegegg 


gtggacgagc 


12480 


agacgtacca 


ggaaatcacc 


agegtgagee gcgcgctggg gcaggaggac 


aegggcaget 


12540 


tggaggcgac 


tctgaacttc 


ctgctgacca accggcggca gcgcctacct 


ccccagtacg 


12600 


cgctgaacgc 


ggaggaggag 


cgcatcctgc gtttcgtgca gcagagcacc 


gcgctgtact 


12660 


tgatgcggga 


aggcgcctct 


cccagcgctt cgctggacat gacggcggcc 


aacatggagc 


12720 


catcgttcta 


cgccgccaac 


cgtcccttcg tcaaccggct aatggactat 


ttgcatcggg 


12780 


cggcggccct 


gaacccggaa 


tactttacta acgtcatcct gaacgaccgt 


tggctgccac 


12840 


ctcccggctt 


etacaegggg 


gagttcgacc tcccggaggc caacgaeggt 


ttcatgtggg 


12900 


acgacgtgga 


cagcgtgttc 


ctgcccggca agaaggaggc gggtgactct 


cagagccacc 


12960 


gcgcgagcct 


cgcagacctg 


ggggcgaccg ggcccgcgtc tccgctgcct 


cgcctgccga 


13020 


gcgccagcag 


cgccagcgtg 


gggcgggtga gccgtccgcg cctcagcggt 


gaggaggact 


13080 


ggtggaacga 


tccgctgctc 


cgtccggccc gcaacaaaaa cttccccaac 


aacgggatag 


13140 


aggatttggt 


agacaaaatg 


aaccgttgga agaegtatge ccaggagcat 


cgggagtggc 


13200 


aggcgaggca 


acccatgggc 


cctgttctgc cgccctctcg gcgcccgcgc 


agggacgaag 


13260 


acgccgacga 


ttcagecgat 


gaeagcageg tgttggatct gggegggage 


gggaacccct 


13320 


ttgcccacct 


gcaacctcgc 


ggcgtgggtc ggcggtggcg ctaggaaaaa 


aaattattaa 


13380 


aagcacttac 


cagagecatg 


gtaagaagag caacaaaggt gtgtcctgct 


ttcttcccgg 


13440 


tagcaaa atg cgt egg gcg gtg gca gtt ccc tec gcg gca atg gcg tta 


13489 



Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu 
15 10 

ggc ccg ccc cct tct tac gaa age gtg atg gca gcg gcc acc ctg caa 13537 
Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin 
15 20 25 3d 

gcg ccg ttg gag aat cct tac gtg ccg ccg cga tac ctg gag cct acg 13585 
Ala Pro Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr 
35 40 45 

ggc ggg aga aac age att cgt tac teg gag ctg acg ccc ctg tac gac 13633 
Gly Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp 
50 55 60 

acc acc cgc ctg tac ctg gtg gac aac aag tea gca gat ate gcc acc 13681 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Thr 
65 70 75 
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ttg aac tac cag aac gac cac age aac ttt etc acg tec gtg gtg cag 13729 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin 
80 85 90 

aac age gac tac acg ccc gec gaa gcg age acg cag ace att aac ttg 13777 
Asn Ser Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr lie Asn Leu 
95 100 105 110 

gac gac cgc teg cgc tgg ggc ggg gac ttg aaa acc att ctg cac act 13825 
Asp Asp Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr lie Leu His Thr 
115 120 125 

aac atg ccc aac gtg aac gag ttc atg ttt acc aac teg ttc agg get 13873 
Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala 
130 135 140 

aaa ctt atg gtg gcg cac gag gee gac aag gac ccg gtt tat gag tgg 13921 
Lys Leu Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp 
145 150 155 

gtg cag ctg acg ctg ccg gag ggg aac ttt tea gag att atg acc ata 13969 
Val Gin Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu lie Met Thr lie 
160 165 170 

gac ctg atg aac aac gec att ate gac cac tac ctg gcg gta gee aga 14017 
Asp Leu Met Asn Asn Ala lie lie Asp His Tyr Leu Ala Val Ala Arg 
175 180 185 190 

cag cag ggg gtg aaa gaa age gag ate ggc gtc aag ttt gac acg cgc 14065 
Gin Gin Gly Val Lys Glu Ser Glu lie Gly Val Lys Phe Asp Thr Arg 
195 200 205 

aac ttt cgt ctg ggc tgg gac ccg gag acg ggg ctt gtg atg ccg ggg 14113 
Asn Phe Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly 
210 215 220 

gtg tac acg aac gaa get ttc cat ccc gac gtg gtc etc ttg ccg ggc 14161 
Val Tyr Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly 
225 230 235 

tgc ggg gtg gac ttt acc tac age egg tta aac aac ctg eta ggc ata 14209 
Cys Gly Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly lie 
240 * 245 250 

cgc aag aga atg ccc ttt cag gaa ggg ttt cag ate ctg tac gag gac 14257 
Arg Lys Arg Met Pro Phe Gin Glu Gly Phe Gin lie Leu Tyr Glu Asp 
255 260 265 270 

ctg gag ggc ggt aac ate ccg gec ctg ctg gac gtg ccg gcg tac gag 14305 
Leu Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu 
275 280 285 

gag age ate gec aac gca agg gag gcg gcg ate agg ggc gat aat ttc 14353 
Glu Ser He Ala Asn Ala Arg Glu Ala Ala He Arg Gly Asp Asn Phe 
290 295 300 
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gcg gcg cag ccc cag gcg get cca acc ata aaa ccc gtt ttg gaa gac 14401 
Ala Ala Gin Pro Gin Ala Ala Pro Thr He Lys Pro Val Leu Glu Asp 
305 310 315 

tec aaa ggg egg age tac aac gta ata gee aac acc aac aac acg get 14449 
Ser Lys Gly Arg Ser Tyr Asn Val He Ala Asn Thr Asn Asn Thr Ala 
320 U 325 330 

tac agg age tgg tat ctg get tat aac tac ggc gac ccg gag aag ggg 14497 
Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
335 340 345 350 

gtt agg gee tgg acc ctg etc acc act ccg gac gtg acg tgc ggt tea 14545 
Val Arg Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser 
355 360 365 

gag cag gtc tac tgg teg ctg cct gac atg tac gtg gac cct gtg acg 14593 
Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr 
370 375 380 

ttt cgc tec acg cag caa gtt age aac tac cca gtg gtg gga gcg gag 14641 
Phe Arg Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu 
385 390 395 

ctt atg ccg att cac age aag age ttt tac aac gag cag gee gtc tac 14689 
Leu Met Pro He His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
400 405 410 

tea cag etc att cgt cag acc acc gec eta acg cac gtt ttc aac cgc 14737 
Ser Gin Leu He Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg 
415 420 425 430 

ttc ccc gag aac caa ate eta gtg cga cct cca gcg ccc acc ate acc 14785 
Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

acc gtc age gag aac gtg ccc get eta acc gat cac ggg acg ctg cct 14833 
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

ttg cag aac age ate cgc gga gtt cag cga gtt acc ate acg gac gee 14881 
Leu Gin Asn Ser He Arg Gly Val Gin Arg Val Thr He Thr Asp Ala 
465 470 475 

cgt cgt egg acc tgt ccc tac gtc tac aaa gee ttg gga ate gtg gee 14929 
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly He Val Ala 
480 485 490 

ccg cgc gtc ctg teg agt cgc act ttc tag atgtccatcc tcatctctcc 14979 
Pro Arg Val Leu Ser Ser Arg Thr Phe 
495 500 

cagcaacaat accggttggg gtctgggcgt gaccaaaatg tacggaggcg ccaaacgacg 15039 

gtccccacaa catcccgtgc gagtgegegg gcactttaga gccccatggg ggtcgcacac 15099 

gcgcgggcgc accggccgaa ccaccgtcga egaegtgate gatagcgtgg tggccgacgc 15159 
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ccgcaactac 


cagcccgctc 


gatccacggt 


ggacgaagtc 


atcgacggcg 


tggtggccga 


15219 


cgccagggcc 


tacgcccgca 


gaaagtctcg 


tctgcgccgc 


cgccgttcgc 


taaagcgccc 


15279 


cacggccgcc 


atgaaagccg 


ctcgctctct 


gctgcgtcgc 


gcacgtatcg 


tgggtcgccg 


15339 


cgccgccaga 


cgcgcagccg 


ccaacgccgc 


cgccggccga 


gtgcgccgcc 


gggccgccca 


15399 


gcaggccgcc 


gccgccatct 


ccagtctatc 


cgccccccga 


cgcgggaatg 


tgtactgggt 


15459 


cagggactcg 


gccaccggcg 


tgcgagttcc 


cgtgagaacc 


cgtcctcctc 


gtccctgaat 


15519 


aaaaagttct 


aagcccaatc 


ggtgttccgt 


tgtgtgttca 


gctcgtcatg 


accaaacgca 


15579 


agtttaaaga 


ggagctgctg 


caagcgctgg 


tccccgaaat 


ctatgcgccg 


gcgccggacg 


15639 


tgaaaccgcg 


tcgcgtgaaa 


cgcgtgaaga 


agcaggaaaa 


gctagagaca 


aaagaggagg 


15699 


cggtggcgtt 


gggagacggg 


gaggtggagt 


ttgtgcgctc 


gttcgcgccg 


cgtcggcgag 


15759 


tgaattggaa 


ggggcgcaag 


gtgcaacggg 


tgctgcgtcc 


cggcacggtg 


gtgtctttca 


15819 


ccccgggtga 


aaaatccgcc 


tggaagggca 


taaagcgcgt 


gtacgatgag 


gtgtacgggg 


15879 


acgaagacat 


tctggagcag 


gcgctggata 


gaagcgggga 


gtttgcttac 


ggcaagaggg 


15939 


cgaggacggg 


cgagatcgcc 


atcccgctgg 


acacttccaa 


ccccaccccc 


agtctgaaac 


15999 


ccgtgacgct 


gcaacaggtg 


ttgccggtga 


gcgccccctc 


gcgacgcggc 


ataaaacgcg 


16059 


agggcggcga 


gctgcagccc 


accatgcagc 


tcctggttcc 


caagaggcag 


aaactagagg 


16119 


acgtactgga 


catgataaaa 


atggagcccg 


acgtgcagcc 


cgatattaaa 


atccgtccca 


16179 


tcaaagaagt 


ggcgccggga 


atgggcgtgc 


agaccgtgga 


catccagatt 


cccatgacca 


16239 


gcgccgcaca 


ggcggtagag 


gccatgcaga 


ccgacgtggg 


gatgatgacg 


gacctgcccg 


16299 


cagctgctgc 


cgccgtggcc 


agcgccgcga 


cgcaaacgga 


agccggcatg 


cagaccgacc 


16359 


cgtggacgga 


ggcgcccgtg 


cagccggcca 


gaagacgcgt 


cagacggacg 


tacggccccg 


16419 


tttctggcat 


aatgccggag 


tacgcgctgc 


atccttccat 


catccccacc 


cccggctacc 


16479 


gggggcgcac 


ctaccgtccg 


cgacgcagca 


ccactcgccg 


ccgtcgccgc 


acggcacgag 


16539 


tcgccaccgc 


cagagtgaga 


cgcgtaacga 


cacgtcgcgg 


ccgccgcttg 


accctgcccg 


16599 


tggtgcgcta 


ccatcccagc 


attctttaaa 


aaaccgctcc 


tacgttgcag 


atgggcaagc 


16659 


ttacttgtcg 


actccgtatg 


gccgtgcccg 


gctaccgagg 


aagatcccgc 


cgacgacgga 


16719 


ctttgggagg 


cagcggtttg 


cgccgccgtc 


gggcggttca 


ccggcgcctc 


aagggaggca 


16779 


ttctgccggc 


cctgatcccc 


ataatcgccg 


cagccatcgg 


ggccattccc 


ggaatcgcca 


16839 
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gcatagcggt 

3 3 3 w 3 3 w 


acaggctagc 

3 3 3 w ^ 3 ^ 


cagcgccact 


gattttacta 


accctgtcgg 


tcgcgccgtc 


16899 


tctttcggca 


gactcaacgc 


ccagcatgga 


agacatcaat 


ttctcctctc 


tggccccgcg 


16959 


gcacggcacg 

3 3 3 j 


cggccgtata 


tggggacgtg 

33333 3 


gagcgagatc 


ggcacgaacc 


agatgaacgg 


17019 


gggcgctttc 


aattggagcg 

33 3 3 


gtgtgtggag 

3 3 3 3 3 3 


cggcttgaaa 


aatttcggtt 


ccactctgaa 


17079 


aacttacggc 


aaccgggtgt 

^ 3 3 3 "3 


ggaactccag 


cacggggcag 


atgctgaggg 


acaagctaaa 


17139 


ggacacgcag 


tttcagcaaa 


aggtggtgga 


cggcatcgct 


tcgggcctca 


acggcgccgt 


17199 


cgacctggcc 


aaccaggcca 


ttcaaaagga 


aattaacagc 


cgcctggagc 


cgcggccgca 


17259 


ggtggaggag 


aacctgcccc 


ctctggaggc 

33 33 


gctgcccccc 


aagggagaga 


agcgcccgcg 


17319 


gcccgacatg 


aaggagacgc 

3"-33 , "*3'"* w 3 w 


tagttactaa 


gagcgaggag 

333333 


ccgccatcat 


acgaggaggc 


17379 


acrtaggtaac 

33 w 333 fc -**3 ^ - 


tcgcagctgc 


cgtccctcac 


gctgaagccc 


accacctatc 


ccatgaccaa 


17439 


gcccatcgcc 


tccatggcgc 


gccccgtggg 

3 w w 3 w 3 3 3 


agtcgacccg 


cccatcgacg 


cggtggccac 


17499 


tttggacctg 


ccgcgccccg 


aacccggcaa 


ccgcgtgcct 


cccgtcccca 


tcgctccgcc 


17559 


ggtttctcgc 


cccgccatcc 


gccccgtcgc 


cgtggccact 


ccccgctatc 


cgagccgcaa 


17619 


cgccaactgg 


cagaccaccc 


tcaacagtat 


tgtcggactg 


ggggtgaagt 


ctctgaagcg 


1 *7 CI Q 

i /o /y 


ccgtcgctgt 


ttttaaagca 


caatttatta 


aacgagtagc 


cctgtcttaa 


tccatcgttg 


17739 


tatgtgtgcc 


tatatcacgc 


gttcagagcc 


tgaccgtccg 


tcaag atg gcc act ccg 


17796 



Met Ala Thr Pro 
505 



teg atg atg ccg cag tgg teg tac atg cac ate gcc ggg cag gac gcc 
Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala 
510 515 520 



17844 



teg gag tac ctg age ccg ggt ctg gtg cag ttt gcc cgt gcg acg gaa 
Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Glu 
525 " 530 535 



17892 



acc tac ttc tea ctg ggc aac aag ttc agg aac ccc acc gtg gcg ccc 
Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro 
540 ^ 545 550 555 



17940 



acc cac gac gtc acc acc gat egg tec cag cga ctg aca ate cgc ttc 
Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr lie Arg Phe 
560 565 570 



17988 



gtc ccc gtg gac aag gaa gac acc get tac tec tac aaa acc cgc ttc 
Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr Lys Thr Arg Phe 
575 580 585 



18036 



acg ctg gcc gtg ggc gac aac egg gtg eta gac atg gcc agt acc tac 
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 
590 595 600 



18084 
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ttt gac ate cgc ggc gtg ate gac cgc gga cct age ttc aag cct tac 18132 
Phe Asp He Arg Gly Val He Asp Arg Gly Pro Ser Phe Lys Pro Tyr 
605 610 615 

tec ggc acg get tac aac tea ctg get ccc aaa ggg gcg ccc aac aac 18180 
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys* Gly Ala Pro Asn Asn 
620 625 630 635 

age caa tgg aac gec aca gat aac ggg aac aag cca gtg tgt ttt get 
Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro Val Cys Phe Ala 
640 645 650 

cag gca get ttt ata ggt caa age att aca aaa gac gga gtg caa ata 
Gin Ala Ala Phe He Gly Gin Ser He Thr Lys Asp Gly Val Gin He 
655 660 665 

cag aac tea gaa aat caa cag get get gee gac aaa act tac caa cca 
Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys Thr Tyr Gin Pro 
670 675 680 

gag cct caa att gga gtt tec acc tgg gat acc aac gtt ace agt aac 18372 
Glu Pro Gin He Gly Val Ser Thr Trp Asp Thr Asn Val Thr Ser Asn 
685 690 695 



get gee gga cga gtg tta aaa gee acc act ccc atg ctg cca tgt tac 
Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met Leu Pro Cys Tyr 
700 " " 705 710 715 

ggt tea tat gee aat ccc act aat cca aac ggg ggt cag gca aaa aca 
Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly Gin Ala Lys Thr 
720 725 730 



aat aat ccc aaa gtg gtt ctt tac age gaa gat gta aac ctt caa gee 
Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Leu Gin Ala 
750 755 760 

ccc gat act cac tta gta tat aag cca acg gtg gga gaa aac gtt ate 
Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly Glu Asn Val He 
765 770 775 

gec gca gaa gee ctg eta acg cag cag gcg tgt ccc aac aga gca aac 
Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro Asn Arg Ala Asn 
780 785 790 795 

tac ata ggt ttc cga gat aac ttt ate ggt tta atg tat tat aac age 
Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser 
800 * 805 810 



18228 



18276 



18324 



18420 



18468 



gaa gga gac att teg eta aac ttt ttc aca aca act gcg gca gca gac 18516 
Glu Gly Asp He Ser Leu Asn Phe Phe Thr Thr Thr Ala Ala Ala Asp 
735 740 745 



18564 



18612 



18660 



18708 



aca ggg aac atg gga gtt ctg gca ggt cag gee teg cag tta aac gca 18756 
Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
815 ~ 820 825 
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gtt gta gac ctg caa gat cga aac acg gaa ctg tec tat cag eta atg 18804 
Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
830 835 840 

eta gat get ctg ggt gac aga act cga tat ttc tea atg tgg aat cag 18852 
Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin 
845 850 855 

gee gtg gac age tac gat cca gac gtt agg att ate gag aac cat ggg 18900 
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu Asn His Gly 
860 865 870 875 



gtg gaa gac gag ctg ccc aat tac tgt ttt cca etc cca ggc atg ggt 
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Pro Gly Met Gly 
880 885 890 



18948 



att ttt aac tec tac aag ggg gta aaa cca caa aat ggc ggt aat ggt 
He Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn Gly Gly Asn Gly 
895 900 905 



18996 



aac tgg gaa gca aac ggg gac eta tea aat gee aat gag ate get tta 
Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn Glu He Ala Leu 
910 ~ ~ 915 920 



19044 



gga aac att ttt gee atg gaa att aac etc cac gca aac ctg tgg cgc 
Gly Asn He Phe Ala Met Glu He Asn Leu His Ala Asn Leu Trp Arg 
925 930 935 



19092 



age ttc ttg tac age aat gtg gcg ctg tac ctg cca gac age tat aaa 
Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys 
940 945 950 955 



19140 



ttc act ccc get aac ate act ctg ccc gec aac caa aac ace tac gag 
Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin Asn Thr Tyr Glu 
960 965 970 



19188 



tat ate aac ggg cgc gtc act tct cca ace ctg gtg gac ace ttt gtt 
Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val Asp Thr Phe Val 
975 980 985 



19236 



aac att gga gee cga tgg teg ccg gat ccc atg gac aac gtc aac ccc 
Asn He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro 
990 995 1000 



19284 



ttt aac cat cac egg aac gcg ggc etc cgt tac cgc 
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
1005 1010 1015 



tec atg ctg 
Ser Met Leu 



19329 



ctg gga aat gga cgc gtg gtg cct ttc cac ata caa gtg ccg caa 19374 
Leu Gly Asn Gly Arg Val Val Pro Phe His He Gin Val Pro Gin 
1020 1025 1030 

aaa ttt ttc gcg att aag aac etc ctg ctt ttg ccc ggc tec tac 19419 
Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 
1035 1040 1045 
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act tac gag tgg age ttc aga aaa gac gtg aac atg att ctg cag 
Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met lie Leu Gin 
1050 1055 1060 



19464 



age acc ctg ggc aat gat ctt cga gtg gac ggg gec age gtc cgc 
Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg 
1065 1070 1075 



19509 



att gac age gtc aac ttg tac gee aac ttt ttc ccc atg gcg cac 
He Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His 
1080 1085 1090 



19554 



aac acc get tct acc ttg gaa gee atg ctg cga aac gac acc aac 
Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 
1095 1100 1105 



19599 



gac cag teg ttt aac gac tac 
Asp Gin Ser Phe Asn Asp Tyr 
1110 1115 



etc age gcg gee aac atg ctt tat 
Leu Ser Ala Ala Asn Met Leu Tyr 
1120 



19644 



ccc att ccg gec aac gee acc aac gtt ccc att tec 
Pro He Pro Ala Asn Ala Thr Asn Val Pro He Ser 
1125 1130 1135 



att ccc tec 
He Pro Ser 



19689 



cgc aac tgg gcg gee ttc egg 
Arg Asn Trp /Via Ala Phe Arg 
1140 1145 



gga tgg age ttc acc cgc ctt aaa 
Gly Trp Ser Phe Thr Arg Leu Lys 
1150 



19734 



gee aag gaa acg cct tec ttg ggc tec ggc ttt gac ccc tac ttt 
Ala Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe 
1155 1160 1165 



19779 



gtg tac tea ggc acc att cct tac ctg gac ggc age ttt tac etc 
Val Tyr Ser Gly Thr He Pro Tyr Leu Asp Gly Ser Phe Tyr Leu 
1170 1175 1180 



19824 



aac cac act ttc aaa cgt ctg tec ate atg ttc gat tct tec gta 
Asn His Thr Phe Lys Arg Leu Ser He Met Phe Asp Ser Ser Val 
1185 1190 1195 



19869 



agt tgg ccg ggc aac gac cgc etc ctg acg ccg aac gag ttc gaa 
Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu 
1200 ~ 1205 1210 



19914 



att aag cgc att gtg gac ggg gaa ggc tac aac gtg get caa agt 
He Lys Arg He Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Ser 
1215 1220 1225 



19959 



aac atg acc aaa gac tgg ttt 
Asn Met Thr Lys Asp Trp Phe 
1230 ~ * 1235 



tta att caa atg etc age cac tac 
Leu He Gin Met Leu Ser His Tyr 
1240 



20004 



aac ate ggc tac caa ggc ttc tat gtt ccc gag ggc 
Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly 
1245 ^ 1250 1255 



tac aag gat 
Tyr Lys Asp 



20049 
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egg atg tat tct ttc ttc cga aac ttt cag ccc atg age cgc cag 
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 
1260 1265 1270 



20094 



gtg ccg gat ccc acc get gec ggc tat caa gec gtt ccc ctg ccc 
Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu Pro 
1275 1280 1285 



20139 



aga caa cac aac aac teg ggc ttt gtg ggg tac atg ggc ccg acc 
Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
1290 1295 1300 



20184 



atg cgc gaa gga cag cca tac ccg gec aac tac ccc tat ccc ctg 
Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 
1305 1310 1315 



20229 



ate ggc get acc gee gtc ccc gee att acc cag aaa aag ttt ttg 
lie Gly Ala Thr Ala Val Pro Ala He Thr Gin Lys Lys Phe Leu 
1320 1325 1330 



20274 



tgc gac cgc gtc atg tgg cgc ata cct ttt tec age aac ttt atg 
Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met 
1335 1340 1345 



20319 



tea atg ggg gec ctg acc gac etc gga cag aac atg ctt tac get 
Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 
1350 1355 1360 



20364 



aac tec gee cat gee ctg gat atg act ttt gag gtg gac ccc atg 
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 
1365 1370 1375 



20409 



aac gag ccc acg ttg ctg tac atg ctt ttt gag gtg ttc gac gtg 
Asn Glu Pro Thr Leu Leu Tyr Met Leu Phe . Glu Val Phe Asp Val 
1380 1385 1390 



20454 



gtc aga gtg cac cag ccg cac cgc ggt att ate gag gec gtg tac 
Val Arg Val His Gin Pro His Arg Gly He He Glu Ala Val Tyr 
1395 1400 1405 



20499 



ctg cgc acc ccc ttc tct gcg ggc aat gec acc aca taa gccgctgaac 20548 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1410 1415 1420 



tagctggttt 


ttaccccaga 


tcccatgggc 


tecaeggaag 


aegaactgeg 


ggccattgtg 


20608 


cgagacctgg 


gctgcggacc 


ctacttcctg 


ggcacctttg 


acaageggtt 


tcccgggttc 


20668 


gtgtctcctc 


gcaaactcgc 


gtgegegate 


gtgaataccg 


ccggccgaga 


gaeeggagga 


20728 


gagcattggc 


tagctctggg 


ctggaacccc 


cgctcgtcca 


cgtttttcct 


gttcgacccc 


20788 


tttggctttt 


cagaccaacg 


cttgaagcag 


atetatgeat 


ttgaatatga 


gggtctactc 


20848 


aagcgaagcg 


cgctggcctc 


ctccgccgat 


cactgtctaa 


ccctggtaaa 


gagcactcag 


20908 


aeggttcagg 


gccctcacag 


cgccgcctgt 


ggcctttttt 


gttgcatgtt 


tttgcacgcc 


20968 
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tttgtgaact 


ggccggacac 


ccccatggaa 


aacaacccca 


ccatggacct 


cctgactggc 


21028 


gttcccaact 


ccatgctcca 


aagccccagc 


gtgcagacca 


ccctcctcca 


aaaccagaaa 


21088 


aatctgtacg 


cctttctgca 


caagcactct 


ccctactttc gccgccatcg 


ggaacaaata 


21148 


gaaaatgcaa 


ccgcgtttaa 


caaaactctg 


taacgtttaa 


taaatgaact 


ttttattgaa 


21208 


ctggaaaacg 


ggtttgtgat 


ttttaaaaat 


caaaggggtt 


gagctggaca 


tccatgtggg 


21268 


aggccggaag 


ggtggtgttc 


ttgtactggt 


acttgggcag 


ccacttaaac 


tctggaatca 


21328 


caaacttggg 


cagcggtatt 


tctgggaagt 


tgtcgtgcca 


cagctggcgg 


gtcagctgaa 


21388 


gtgcctgcag 


aacatcgggg 


gcggagatct 


tgaagtcgca 


gtttatctgg 


ttcacggcac 


21448 


gcgcgttgcg 


gtacatggga 


ttggcacact 


gaaacaccag 


caggctggga 


ttcttgatgc 


21508 


tagccagggc 


cacggcgtcg 


gtcacgtcac 


cggtgtcttc tatgttggac 


agcgaaaaag 


21568 


gcgtgacttt 


gcaaagctgg 


cgtcccgcgc 


gaggcacgca 


atctcccagg 


tagttgcact 


21628 


cacagcggat 


gggcagaaga 


agatgcttgt 


ggccgcgggt 


catgtaggga 


taggccgctg 


21688 


ccataaaagc 


ttcgatctgc 


ctgaaagcct 


gcttggcctt 


gtgcccttcg 


gtataaaaaa 


21748 


caccgcagga 


cttgttggaa 


aaggtattac 


tggcgcaagc ggcatcgtga 


aagcaagcgc 


21808 


gtgcgtcttc 


gtttcgtaac 


tgcaccacgc 


tgcggcccca 


ccggttctga 


atcaccttgg 


21868 


ccctgccggg 


gttttccttg 


agagcgcgct 


ggccggcttc gctgcccaca 


tccatttcca 


21928 


cgacatgctc 


cttgttaatc 


atggccagac 


cgtggaggca 


gcgcagctcc 


tcgtcatcgt 


21988 


cggtgcagtg 


atgctcccac 


acgacgcagc 


cagtgggctc 


ccacttgggc 


ttggaggcct 


22048 


cggcaatgcc 


agaatacagg 


agaacgtagt 


ggtgcagaaa 


acgtcccatc 


atggtgccaa 


22108 


aggttttctg 


gctgctgaag 


gtcatcgggc 


agtacctcca gtcctcgtta 


agccaagtgt 


22168 


tgcagatctt 


cctgaagacc 


gtgtactgat 


cgggcataaa 


>gtggaactca 


ttgcgctcgg 


22228 


tcttgtcgat 


cttatacttt 


tccatcagac 


tatgcataat 


ctccatgccc 


ttttcccagg 


22288 


cgcaaacaat 


cttggtgcta 


cacgggttag 


gtatggccaa 


agtggttggc 


ctctgaggcg 


22348 


gcgcttgttc 


ttcctcttga 


gccctctccc 


gactgacggg 


ggttgaaaga 


gggtgcccct 


22408 


tggggaacgg 


cttgaacacg 


gtctggcccg 


aggcgtcccg 


aagaatctgc 


atcgggggat 


22468 


tgctggccgt 


catggcgatg 


atctgacccc 


ggggctcctc 


cacttcgtcc 


tcctcgggac 


22528 


tttcctcgtg 


cttttcgggg 


gacggtacgg 


gagtaggggg aagagcgcgg 


cgcgccttct 


22588 


tcttgggcgg 


cagttccgga 


gcctgctctt 


gacgactggc 


cattgtcttc 


tcctaggcaa 


22648 
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gaaaaacaag atggaagact ctttctcctc ctcctcgtca acgtcagaaa gcgagtcttc 22708 

caccttaagc gccgagaact cccagcgcat agaatccgat gtgggctacg agactccccc 22768 

cgcgaacttt tcgccgcccc ccataaacac taacgggtgg acggactacc tggccctagg 22828 

agacgtactg ctgaagcaca tcaggcggca gagcgttatc gtgcaagatg ctctcaccga 22888 

gcgactcgcg gttccgctgg aagtggcgga acttagcgcc gcctacgagc gaaccctctt 22948 

ctccccaaag actcccccca agaggcaggc taacggcacc tgcgagccta accctcgact 23008 

caacttctac cctgcctttg ccgtgccaga ggtactggct acgtaccaca tttttttcca 23068 

aaaccacaaa atccctctct cgtgccgcgc caaccgcacc aaagccgatc gcgtgctgcg 23128 

actggaggaa ggggctcgca tacctgagat tgcgtgtctg gaggaagtcc caaaaatctt 23188 

tgaaggtctg ggccgcgacg aaaagcgagc agcaaacgct ctggaagaga acgcagagag 23248 

tcacaacagc gccttggtag aactcgaggg cgacaacgcc agactggccg tcctcaaacg 23308 

gtccatagaa gtcacgcact tcgcctaccc cgccgttaac ctccctccaa aagttatgac 23368 

agcggtcatg gactcgctgc tcataaagcg cgctcagccc ttagacccag agcacgaaaa 23428 

caacagtgac gaaggaaaac cggtggtttc tgatgaggag ttgagcaagt ggctgtcctc 23488 

caacgacccc gccacgttgg aggaacgaag aaaaaccatg atggccgtgg tgctagttac 23548 

cgtgcaatta gaatgtctgc agaggttctt ttcccaccca gagaccctga gaaaagtgga 23608 

ggaaacgctg cactacacat ttaggcacgg ctacgtgaag caagcctgca agatttccaa 23668 

cgtagaactt agcaacctca tctcctacct ggggatcttg cacgaaaacc gcctcggaca 23728 

aaacgtgctg cacagcacac tgaaaggaga agcccgccga gactatgtgc gagactgcgt 23788 

gttcctagcg ctagtgtaca cctggcagag cggaatggga gtctggcagc agtgcctgga 23848 

ggacgaaaac ctcaaagagc ttgaaaagct gctggtgcgc tccagaaggg cactgtggac 23908 
cagttttgac gagcgcaccg ccgcgcgaga cctagctgat attatttttc ctcccaagct 23968 
ggtgcagact ctccgggaag gactgccaga ttttatgagt caaagcatct tgcaaaactt 24028 
ccgctctttc atcttggaac gctcgggaat cttgcccgcc actagctgcg ccctacccac 24088 
agattttgtg cbtctccact accgcgaatg cccaccgccg ctgtggccgt acacttactt 24148 
gcttaaactg gccaactttc taatgttcca ctctgacctg gcagaagacg ttagcggcga 24208 
ggggctgcta gaatgccact gccgctgcaa cctgtgcacc ccccaccgct ctctagtatg 24268 
caacactccc ctgctcaatg agacccagat catcggtacc tttgaaatcc agggaccctc 24328 
cgacgcggaa aacggcaagc aggggtctgg gctaaaactc acagccggac tgtggacctc 24388 
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cgcctacttg cgcaaatttg taccagaaga ctatcacgcc caccaaatta aattttacga 24448 

aaaccaatca aaaccaccca aaagcgagtt aacggcttgc gtcattacgc agagcagcat 24508 

agttgggcag ttgcaagcca ttaacaaagc gcggcaagag tttctcctaa aaaaaggaaa 24568 

aggggtctac ttggaccccc agaccggcga ggaactcaac ggaccctcct cagtcgcagg 24628 

ttgtgtgccc catgccgccc aaaaagaaca cctcgcagtg gaacatgcca gagacggagg 24688 

aagaggagtg gagcagtgtg agcaacagcg aaacggagga agagccgtgg cccgaggggt 24748 

gcaacgggga agaggacacg gagggacggc gaagtcttcg ccgaagaact ctcgccgctg 24808 

cccccgaagt cccagccggc cgcctcggcc caagatcccg cacacacccg tagatgggat 248 68 

agcaagacca aaaagccggg taagagaaac gctcgccccc gccagggcta ccgctcgtgg 24928 

agaaagcaca aaaactgcat cttatcgtgc ttgctccagt gcggcggaga cgtttcgttc 24988 

acccgtagat acttgctttt taacaaaggg gtggccgtcc cccgtaacgt cctccactac 25048 

taccgtcact cttacagctc cgaagcggac ggctaagaaa acgcagcagt tgccggcggg 25108 

aggactgcgt ctcagcgccc gagaaccccc agccaccagg gagctccgaa accgcatatt 25168 

tcccaccctc tacgctatct ttcagcaaag ccgggggcag cagcaagaac tgaaaataaa 25228 

aaaccgcacg ctgaggtcgc ttacccgaag ctgcctctat cacaagagcg aagagcagct 25288 

gcagcgaacc ctggaggacg cagaagcgct gttccagaag tactgcgcga ccaccctaaa 25348 

taactaaaaa agcccgcgcg cgggacttca aaccgtctga cgtcaccagc cgcgcgccaa 25408 

aatgagcaaa gagattccca cgccttacat gtggagttac cagccgcaga tgggattagc 25468 

cgccggcgcc gcccaggatt actccacgaa aatgaactgg ctcagcgccg ggccccacat 25528 

gatttcccgc gtaaacgaca ttcgcgccca ccgcaatcag ctattgttag aacaggctgc 25588 

tctgaccgcc acgccccgta ataacctgaa ccctcccagc tggccagctg ccctggtgta 25648 
ccaggaaacg cctccaccca ccagcgtact tttgccccgt gacgcccagg cggaagtcca 25708 
gatgactaac gcgggcgcgc aattagcggg cggatcccgg tttcggtaca gagttcacgg 25768 
cgccgcaccc tatagcccag gtataaagag gctgatcatt cgaggcagag gtgtccagct 25828 
caacgacgag acagtgagct cttcgcttgg tctacgacca gacggagtgt tccagctcgc 25888 
gggctcgggc cgctcttcgt tcacgcctcg ccaggcatac ctgactctgc agagctctgc 25948 
ctctcagcct cgctcgggag gaatcggacc ccttcagttt gtggaggagt ttgtgccctc 26008 
ggtctacttt cagcctttct ccggatcgcc cggccagtac ccggacgagt tcatccccaa 26068 
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cttcgacgcg gtgagtgact ctgtggacgg ttatgactga tgtcgagccc gcttcagtgc 26128 

tagtggaaca agcgcggctc aatcacctgg ttcgttgccg ccgccgctgc tgcgtggctc 26188 

gcgacttgag cttagctctc aagtttgtaa aaaacccgtc cgaaaccggg agcgctgtgc 26248 

acgggttgga gctagtgggt cctgagaagg ccaccatcca cgttctcaga aactttgtgg 26308 

aaaaacccat tttggttaaa cgagatcagg ggccttttgt aatcagctta ctctgcacct 26368 

gtaaccatgt tgaccttcac gactatttta tggatcattt gtgcgctgaa ttcaataagt 26428 

aaagcgaatt cttaccaaga ttatgatgtc catgactgtt cctcgccact atacgatgtt 26488 

gtgccagtaa actctcttgt cgacatctat ctgaactgtt ccttttggtc cgcacagctt 26548 

acttggtact acggtgacac cgtcctttct ggctcactgg gcagctcaca cggaataaca 26608 

cttcacctct tttcgccgtt tcgatacgga aactacagct gtcgtgccgg tacctgcctc 26668 

cacgttttca atcttcagcc ctgtccaccg accaaacttg tatttgtcga ctctaagcac 26728 

ttacagctca actgcagcat tctaggcccc agtatcttgt ggacatacaa taaaatcagg 26788 

ttggtggaat ttgtctacta cccacccagc gcccgcggtt ttggggaaat tcctttccag 26848 

atctactaca actatcttgc cacacattat gcaagtcaac agcaactaaa cttgcaagca 26908 

cccttcacgc caggagagta ctcctgtcac gtaggctcct gcacagaaac ttttattctc 26968 

ttcaacagat cttctgccat tgaacgcttc actactaact actttagaaa ccaagttgtg 27028 

cttttcactg acgaaacccc taacgtcacc ctggactgtg catgtttttc tcatgacacc 27088 

gtaacttgga ctcttaacaa tactctctgg ctcgcgttcg ataaccaaag cttgattgtt 27148 

aaaaattttg atttaacctt tactaaaccc tctcctcgcg aaatagttat ctttgctcct 27208 

tttaatccaa aaactacctt agcctgtcag gttttgttta agccttgcca aacaaacttt 27268 

aagtttgttt atttgcctcc gcaatctgtc aaactcatag aaaaatacaa caaagcgccc 27328 

gtcttggctc ctaaaacctt ctaccactgg ctaacctaca cggggctgtt tgcactaatt 27388 

gtttttttcc taattaacat ttttatatgt ttcttgcctt cctccttctt ttcgcgaaca 27448 

ccgttgccgc agaaagacct ctccttatta ctgtagcgct tgctatacaa aaccaagagt 27508 

ggtcaaccgt gctctcaatc tattttcaat ttttcatttt gtccttaata ctttctctta 27568 

ttgtcgttaa caatgatctg gagcattggt ctcgcctttt tttggctgct tagtgcaaaa 27628 
gccactattt ttcacaggta tgtggaagaa ggaactagca ccctctttac gatacctgaa 27688 
acaattaagg cggctgatga agtttcttgg tacaaaggct cgctctcaga cggcaaccac 27748 
tcattctcag gacagaccct ttgcatccaa gaaacttatt ttaaatcaga actacaatac 27808 
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agctgcataa aaaacttttt ccatctctac aacatctcaa aaccctatga gggtatttac 27868 

aatgccaagg tttcagacaa ctccagcaca cggaactttt actttaatct gacagttatt 27928 

aaagcaattt ccattcctat ctgtgagttt agctcccagt ttctttctga aacctactgt 27988 

ttaattacta taaactgcac taaaaatcgc cttcacacca ccataatcta caatcacaca 28048 

caatcacctt gggttttaaa cctaaaattt tctccacaca tgccttcgca atttctcacg 28108 

caagttaccg tctctaacat aagcaagcag tttggctttt actatccttt ccacgaactg 28168 

tgcgaaataa ttgaagccga atatgaacca gactacttta cttacattgc cattggtgta 28228 

atcgttgttt gcctttgctt tgttattggg gggtgtgttt atttgtacat tcagagaaaa 28288 

atattgctct cgctgtgctc ctgcggttac aaagcagaag aaagaattaa aatctctaca 28348 

ctttattaat gttttccaga aatggcaaaa ctaacgctcc tacttttgct tctcacgccg 28408 

gtgacgcttt ttaccatcac tttttctgcc gccgccacac tcgaacctca atgtttgcca 28468 

ccggttgaag tctactttgt ctacgtgttg ctgtgctgcg ttagcgtttg cagtataaca 28528 

tgttttacct ttgtttttct tcagtgcatt gactacttct gggtcagact ctactaccgc 28588 

agacacgcgc ctcagtatca aaatcaacaa attgccagac tactcggtct gccatgattg 28648 

tcttgtattt taccctgatt ttttttcacc ttacttgcgc ttgtgatttt cacttcactc 28708 

aattttggaa aacgcaatgc ttcgacccgc gcctctccaa cgactggatg atggctcttg 28768 

caattgccac gcttggggcg tttggacttt ttagtggttt tgctttgcat tacaaattta 28828 

agactccatg gacacatggc tttctttcag attttccagt tacacctact ccgccgcctc 28888 

ccccggccat cgacgtgcct caggttccct caccttctcc atctgtctgc agctactttc 28948 

atctgtaatg gccgacctag aatttgacgg agtgcaatct gagcaaaggg ctatacactt 29008 

ccaacgccag tcggaccgcg aacgcaaaaa cagagagctg caaaccatac aaaacaccca 29068 

ccaatgtaaa cgcgggatat tttgtattgt aaaacaagct aagctccact acgagcttct 29128 

atctggcaac gaccacgagc tccaatacgt ggtcgatcag cagcgtcaaa cctgtgtatt 29188 

cttaattgga gtttccccca ttaaagttac tcaaaccaag ggtgaaacca agggaaccat 29248 

aaggtgctca tgtcacctgt cagaatgcct ttacactcta gttaaaaccc tatgtggctt 29308 

acatgattct atccccttta attaaataaa cttactttaa atctgcaatc acttcttcgt 29368 

ccttgttttt gtcgccatcc agcagcacca ccttcccctc ttcccaactt tcatagcata 29428 

ttttccgaaa agaggcgtac tttcgccaca ccttaaaggg aacgtttact tcgctttcaa 29488 
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gctctcccac gattttcatt gcagat atg aaa cgc gcc aaa 

Met Lys Arg Ala Lys 
1425 



gtg gaa gaa gga 
Val Glu Glu Gly 



29541 



ttt aac ccc gtt tat ccc 
Phe Asn Pro Val Tyr Pro 
1430 " 1435 



tat gga tat tct act ccg act gac gtg 
Tyr Gly Tyr Ser Thr Pro Thr Asp Val 
1440 



29586- 



gct cct ccc ttt gta gcc tct gac ggt ctt caa gaa aac cca cct 
Ala Pro Pro Phe Val Ala Ser Asp Gly Leu Gin Glu Asn Pro Pro 
1445 1450 1455 



29631 



ggg gtc ttg tec eta aaa ata tec aaa cct tta act ttt aat gcc 
Gly Val Leu Ser Leu Lys lie Ser Lys Pro Leu Thr Phe Asn Ala 
1460 1465 1470 



29676 



tec aag get eta age ctg get att ggt cca gga tta aaa att caa 
Ser Lys Ala Leu Ser Leu Ala lie Gly Pro Gly Leu Lys lie Gin 
1475 1480 1485 



29721 



gat ggt aaa eta gtg ggg gag gga caa gca att 
Asp Gly Lys Leu Val Gly Glu Gly Gin Ala lie 
1490 1495 1500 



ctt gca aac ctg 29766 
Leu Ala Asn Leu 



ccg ctt caa ate acc aac aac aca att tea eta cgt ttt ggg aac 
Pro Leu Gin lie Thr Asn Asn Thr lie Ser Leu Arg Phe Gly Asn 
1505 1510 1515 



29811 



aca ctt gcc ttg aat gac aat aat gaa etc caa acc aca eta aaa 
Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr Leu Lys 
1520 1525 1530 



29856 



tct tea teg ccc ctt aaa ate aca gac cag act ctg tec ctt aac 
Ser Ser Ser Pro Leu Lys lie Thr Asp Gin Thr Leu Ser Leu Asn 
1535 1540 * 1545 



29901 



ata ggg gac age ctt gca 
lie Gly Asp Ser Leu Ala 
1550 1555 



att aaa gat gac aaa eta gaa age get 
lie Lys Asp Asp Lys Leu Glu Ser Ala 
1560 



29946 



ctt caa gcg acc etc cca etc tec att age aac aac acc ate age 
Leu Gin Ala Thr Leu Pro Leu Ser lie Ser Asn Asn Thr lie Ser 
1565 1570 1575 



29991 



etc aac gtg ggc acc gga etc acc ata aat gga aac gtt tta caa 
Leu Asn Val Gly Thr Gly Leu Thr lie Asn Gly Asn Val Leu Gin 
1580 1585 1590 



30036 



get gtt ccc tta aat get eta agt ccc eta act 
Ala Val Pro Leu Asn Ala Leu Ser Pro Leu Thr 
1595 1600 1605 



att tec aac aat 
lie Ser Asn Asn 



30081 



aac ate age ctg cgc tat ggc agt tec ctg acg gtg ctt aac aat 
Asn lie Ser Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn 
1610 1615 1620 



30126 
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gaa ctg caa age aac etc aca gtt cac tec cct tta aaa etc aac 
Glu Leu Gin Ser Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn 
1625 1630 1635 



30171 



tec aac aac tea att tct 
Ser Asn Asn Ser lie Ser 
1640 1645 



etc aac act eta tct ccg ttt aga ate 
Leu Asn Thr Leu Ser Pro Phe Arg lie 
1650 



30216 



gag aat ggt ttc etc acg etc tat ttg gga aca aaa tct ggc ttg 
Glu Asn Gly Phe Leu Thr Leu Tyr Leu Gly Thr Lys Ser Gly Leu 
1655 1660 ~ 1665 



30261 



eta gtt caa aac agt ggc tta aaa gtt caa gcg ggc tac ggc ctg 
Leu Val Gin Asn Ser Gly Leu Lys Val Gin Ala Gly Tyr Gly Leu 
1670 1675 " 1680 



30306 



caa gta aca gac ace aat get etc aca tta aga tat etc get cca 
Gin Val Thr Asp Thr Asn Ala Leu Thr Leu Arg Tyr Leu Ala Pro 
1685 1690 1695 



30351 



ctg acc att cca gac teg ggc tea gaa caa ggc att ctt aaa gta 
Leu Thr lie Pro Asp Ser Gly Ser Glu Gin Gly lie Leu Lys Val 
1700 1705 1710 



30396 



aac act gga cag ggc eta agt gtg aac caa get gga gcg ctt gaa 
Asn Thr Gly Gin Gly Leu Ser Val Asn Gin Ala Gly Ala Leu Glu 
1715 1720 1725 



30441 



aca tec eta gga ggt gga 
Thr Ser Leu Gly Gly Gly 
1730 1735 



tta aaa tat get gat aac aaa ata acc 
Leu Lys Tyr Ala Asp Asn Lys lie Thr 
1740 



30486 



ttt gat aca gga aac gga ctg aca tta tct gaa aat aaa ctt gca 
Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn Lys Leu Ala 
1745 1750 1755 



30531 



gta get gca ggt agt ggt eta act ttt aga gat ggt gec ttg gta 
Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala Leu Val 
1760 1765 1770 



30576 



gee acg gga acc gca ttt acg caa aca ctg tgg act acg get gat 
Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala Asp 
1775 1780 1785 



30621 



ccg tct ccc aac tgc aca att ata cag gac cgc gac aca aaa ttt 
Pro Ser Pro Asn Cys Thr lie lie Gin Asp Arg Asp Thr Lys Phe 
1790 1795 1800 



30666 



act ttg gcg ctt acc att agt ggg age caa gtg 
Thr Leu Ala Leu Thr lie Ser Gly Ser Gin Val 
1805 1810 1815 



ctg ggg acg gtt 
Leu Gly Thr Val 



30711 



tec att att gga gta aaa ggc ccc ctt tea agt 
Ser lie He Gly Val Lys Gly Pro Leu Ser Ser 
1820 1825 1830 



age ata ccg tea 
Ser He Pro Ser 



30756 
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get acc gtt aca gta caa ctt aac ttt gat tec aac gga gec eta 30801 

Ala Thr Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu 
1835 1840 * 1845 

ttg age tec tct tea ctt aaa ggt tac tgg ggg tat cgc caa ggt 30846 

Leu Ser Ser Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly 
1850 1855 1860 

ccc tea att gac cct tac ccc ata att aat gee tta aac ttt atg 30891 

Pro Ser lie Asp Pro Tyr Pro lie lie Asn Ala Leu Asn Phe Met 
1865 1870 1875 

cca aac tea ctg get tat ccc ccg gga caa gaa ate caa gca aaa 30936 

Pro Asn Ser Leu Ala Tyr Pro Pro Gly Gin Glu lie Gin Ala Lys 
1880 1885 1890 

tgt aac atg tac gtt tct act ttt tta cga gga aat cca caa aga 30981 

Cys Asn Met Tyr Val Ser Thr Phe Leu Arg Gly Asn Pro Gin Arg 
1895 1900 1905 

cca ata gtt tta aac ate act ttt aat aat caa acc age ggg ttt 31026 

Pro lie Val Leu Asn lie Thr Phe Asn Asn Gin Thr Ser Gly Phe 
1910 1915 1920 

tec att aga ttt aca tgg aca aat tta acc aca gga gaa gca ttt 31071 

Ser lie Arg Phe Thr Trp Thr Asn Leu Thr Thr Gly Glu Ala Phe 
1925 1930 1935 



gca atg ccc cca tgc act ttt tec tac att get gaa caa caa taa 31116 
Ala Met Pro Pro Cys Thr Phe Ser Tyr lie Ala Glu Gin Gin 
1940 1945 1950 



actatgtaac 


cctcaccgtt 


aacccgcctc 


cgcccttcca 


ttttatttta 


taaaccaccc 


31176 


gatccacctt 


ttcagcagta 


aacaattgea 


tgtcagtagg 


ggcagtaaaa 


cttttgggag 


31236 


ttaaaatcca 


cacaggttct 


tcacaagcta 


agegaaaate 


agttacactt 


ataaaaccat 


31296 


cgctaacatc 


ggacaaagac 


aagcatgagt 


ccaaagcttc 


cggttctgga 


tcagattttt 


31356 


gttcattaac 


agegggagaa 


acagcttctg 


gaggattttc 


catctccatc 


tccttcatca 


31416 


gttccaccat 


gtccaccgtg 


gtcatctggg 


acgagaacga 


cagttgtcat 


acacctcata 


31476 


agtcaccggt 


cgatgacgaa 


egtacagate 


tcgaagaatg 


tcctgtcgcc 


gcctttcggc 


31536 


agcactgggc 


egaaggegaa 


agcgcccatg 


tttaacaatg 


gccagcaccg 


cccgcttcat 


31596 


caggcgccta 


gttcttttag 


cgcaacagcg 


catgcgcagc 


tegctaagae 


tggegcaaga 


31656 


aacacagcac 


agaaccacca 


gattgttcat 


gatcccataa 


gcgtgctgac 


accagcccat 


31716 


actaacaaat 


tgtttcacta 


ttctagcatg 


aatgtcatat 


ctgatgttca 


agtaaattaa 


31776 


atggcgcccc 


cttatgtaaa 


cacttcccac 


gtacaacacc 


tcctttggca 


tctgataatt 


31836 


aaccacctcc 


cgataccaaa 


tacatctctg 


attaatagtc 


gccccgtaca 


ctacccgatt 


31896 
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aaaccaagtt 


gccaacataa 


tcccccctgc 


catacactgc 


aaagaacctg 


gacggctaca 


31956 


atgacagtgc 


aaagtccaca 


cctcgttgcc 


atggataact 


gaggaacgcc 


ttaagtcaat 


32016 


agtggcacaa 


ctaatacaaa 


catgtaaata 


gtgtttcaac 


aagtgccact 


cgtatgaggt 


32076 


gagtatcatg 


tcccagggaa 


cgggccactc 


cataaacact 


gcaaaaccaa 


cacatcctac 


32136 


catcccccgc 


acggcactca 


catcgtgcat 


ggtgttcata 


tcacagtccg 


gaagctgagg 


32196 


acaaggaaaa 


gtctcgggag 


cattttcata 


gggcggtagt 


gggtactcct 


tgtaggggtt 


32256 


cagtcggcac 


cggtatctcc 


tcaccttctg 


ggccataaca 


cacaagttga 


gatctgattt 


32316 


caaggtactt 


tctgaatgaa 


aaccaagtgc 


tttcccaaca 


atgtatccga 


tgtcttcggt 


32376 


ccccgcgtcg 


gtagcgctcc 


ttgcagtaca 


cacggaacaa 


ccactcacgc 


aggcccagaa 


32436 


gacagttttc 


cgcggacggt 


gacaagttaa 


tccccctcag 


tctcagagcc 


aatatagttt 


32496 


cttccacagt 


agcataggcc 


aaacccaacc 


aggaaacaca 


agctggcacg 


tcccgttcaa 


32556 


cgggaggaca 


aggaagcaga 


ggcagaggca 


taggcaaagc 


aacagaattt 


ttattccaac 


32616 


tggtcacgta 


gcacttcaaa 


caccaggtca 


cgtaaatggc 


agcgatcttg 


ggtttcctga 


32676 


tggaacataa 


cagcaagatc 


aaacatgaga 


cgattctcaa 


ggtgattaac 


cacagctgga 


32736 


attaaatcct 


ccacgcgcac 


atttagaaac 


accagcaata 


caaaagcccg 


gttttctccg 


32796 


ggatctatca 


tagcagcaca 


gtcatcaatt 


agtcccaagt 


aattttcccg 


tttccaatct 


32856 


gttataattt 


gcagaataat 


gccctgtaaa 


tccaagccgg 


ccatggcgaa 


aagctcagat 


32916 


aatgcacttt 


ccacgtgcat 


tcgtaaacac 


accctcatct 


tgtcaatcca 


aaaagtcttc 


32976 


ttcttgagaa 


acctgtagta 


aattaagaat 


cgccaggtta 


ggctcgatgc 


ctacatcccg 


33036 


gagcttcatt 


ctcagcatgc 


actgcaaatg 


atccagcaga 


tcagaacagc 


aattagcagc 


33096 


cagctcatcc 


ccggtttcca 


gttccggagt 


tcccacggca 


attatcactc 


gaaacgtggg 


33156 


acaaatcgaa 


ataacatgag 


ctcccacgtg 


agcaaaagcc 


gtagggccag 


tgcaataatc 


33216 


acagaaccag 


cggaaaaaag 


attgcagctc 


atgtttcaaa 


aagctctgca 


gatcaaaatt 


33276 


cagctcatgc 


aaataacaca 


gtaaagtttg 


cggtatagta 


accgaaaacc 


acacgggtcg 


33336 


acgttcaaac 


atctcggctt 


acctaaaaaa 


gaagcacatt 


tttaaaccac 


agtcgcttcc 


33396 


tgaacaggag 


gaaatatggt 


gcggcgtaaa 


accagacgcg 


ccaccggatc 


tccggcagag 


33456 


ccctgataat 


acagccagct 


gtggttaaac 


agcaaaacct 


ttaattcggc 


aacggttgag 


33516 


gtctccacat 


aatcagcgcc 


cacaaaaatc 


ccatctcgaa 


cttgctcgcg 


tagggagcta 


33576 
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aaatggccag tatagcccca tggcacccga acgctaatct gcaagtatat gagagccacc 33636 

ccattcggcg ggatcacaaa atcagtcgga gaaaacaacg tatacacccc ggactgcaaa 33696 

agctgttcag gcaaacgccc ctgcggtccc tctcggtaca ccagcaaagc ctcgggtaaa 33756 

gcagccatgc caagcgctta ccgtgccaag agcgactcag acgaaaaagt gtactgaggc 33816 

gctcagagca gcggctatat actctacctg tgacgtcaag aaccgaaagt caaaagttca 33876 

cccggcgcgc ccgaaaaaac ccgcgaaaat ccacccaaaa agcccgcgaa aaacacttcc 33936 

gtataaaatt tccgggttac cggcgcgtca ccgccgcgcg acacgcccgc cccgccccgc 33996 

gctcctcccc gaaacccgcc gcgcccactt ccgcgttccc aagacaaagg tcgcgtaact 34056 

ccgcccacct catttgcatg ttaactcggt cgccatcttg cggtgttata ttgatgatg 34115 



<210> 35 
<211> 503 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 35 

Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu Gly Pro 
15 10 15 

Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin Ala Pro 
20 25 30 



Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr Gly Gly 
35 40 45 

Arg Asn Ser lie Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp Thr Thr 
50 ^ 55 60 

Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Thr Leu Asn 
65 70 75 80 

Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin Asn Ser 
85 90 95 

Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr He Asn Leu Asp Asp 
100 105 HO 

Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu His Thr Asn Met 
115 120 125 

Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala Lys Leu 
130 135 140 

Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp Val Gin 
145 150 155 160 
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Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu lie Met Thr lie Asp Leu 
165 170 175 

Met Asn Asn Ala lie lie Asp His Tyr Leu Ala Val Ala Arg Gin Gin 
180 185 190 

Gly Val Lys Glu Ser Glu He Gly Val Lys Phe Asp Thr Arg Asn Phe 
195 200 205 

Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly Val Tyr 
210 215 220 

Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly Cys Gly 
225 230 235 240 

Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly He Arg Lys 
245 250 255 

Arg Met Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu 
260 265 270 

Gly Gly Asn He Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser 
275 280 " 285 

He Ala Asn Ala Arg Glu Ala Ala He Arg Gly Asp Asn Phe Ala Ala 
290 295 300 

Gin Pro Gin Ala Ala Pro Thr He Lys Pro Val Leu Glu Asp Ser Lys 
305 310 315 320 

Gly Arg Ser Tyr Asn Val He Ala Asn Thr Asn Asn Thr Ala Tyr Arg 
325 330 335 

Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
340 345 350 

Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser Glu Gin 
355 360 365 

Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr Phe Arg 
370 375 380 

Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Met 
385 390 395 400 

Pro He His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 
405 410 415 

Leu He Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 
420 425 430 

Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val 
435 440 445 



Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Gin 
450 455 460 
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Asn Ser lie Arg Gly Val Gin Arg Val Thr lie Thr Asp Ala Arg Arg 
465 470 475 480 

Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly He Val Ala Pro Arg 
485 490 495 

Val Leu Ser Ser Arg Thr Phe 
500 



<210> 36 
<211> 917 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 36 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
15 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Glu Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr He Arg Phe Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 80 

Lys Thr Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val He Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Asn Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro 
130 135 140 

Val Cys Phe Ala Gin Ala Ala Phe He Gly Gin Ser He Thr Lys Asp 
145 150 155 160 

Gly Val Gin He Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys 
165 170 175 

Thr Tyr Gin Pro Glu Pro Gin He Gly Val Ser Thr Trp Asp Thr Asn 
180 185 190 

Val Thr Ser Asn Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met 
195 200 205 
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Leu Pro Cys Tyr Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly 
210 ~ 215 220 

Gin Ala Lys Thr Glu Gly Asp lie Ser Leu Asn Phe Phe Thr Thr Thr 
225 230 235 240 

Ala Ala Ala Asp Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val 
245 250 255 

Asn Leu Gin Ala Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly 
260 265 270 

Glu Asn Val lie Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro 
275 280 285 

Asn Arg Ala Asn Tyr lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met 
290 295 300 

Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 375 380 

Pro Gly Met Gly lie Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn 
385 ~ 390 395 400 

Gly Gly Asn Gly Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn 
405 410 415 

Glu He Ala Leu Gly Asn He Phe Ala Met Glu He Asn Leu His Ala 
420 425 430 

Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro 
435 440 445 

Asp Ser Tyr Lys Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin 
450 ~ 455 460 

Asn Thr Tyr Glu Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val 
465 " 470 475 480 

Asp Thr Phe Val Asn lie Gly Ala Arg Trp Ser Pro Asp Pro Met Asp 
485 490 495 

Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
500 505 510 
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Ser Met Leu Leu Gly Asn Gly Arg Val Val Pro Phe His He Gin Val 
515 520 525 

Pro Gin Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser 
530 535 540 

Tyr Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met He Leu Gin 
545 * 550 555 560 

Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg He 
565 " 570 575 

Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr 
580 585 590 

Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 
595 600 605 

Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro Ala 
610 * 615 620 

Asn Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala 
625 630 635 640 

Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Ala Lys Glu Thr Pro Ser 
645 650 655 

Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Thr He Pro 
660 665 670 

Tyr Leu Asp Gly Ser Phe Tyr Leu Asn His Thr Phe Lys Arg Leu Ser 
675 680 685 

He Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 
690 695 700 

Thr Pro Asn Glu Phe Glu He Lys Arg He Val Asp Gly Glu Gly Tyr 
705 710 715 720 

Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met 
725 730 735 

Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly 
740 745 750 

Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 
755 760 765 



Arg Gin Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu 
770 " 775 780 

Pro Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
785 790 795 800 

Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
805 810 815 
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Gly Ala Thr Ala Val Pro Ala lie Thr Gin Lys Lys Phe Leu Cys Asp 
820 825 830 

1 

Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
835 " 840 845 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
850 855 860 

Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asn Glu Pro Thr Leu 
865 870 875 880 

Leu Tyr Met Leu Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
885 890 895 

His Arg Gly He He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
900 905 910 

Gly Asn Ala Thr Thr 
915 



<210> 37 
<211> 533 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 37 

Met Lys Arg Ala Lys Val Glu Glu Gly Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Gly Tyr Ser Thr Pro Thr Asp Val Ala Pro Pro Phe Val Ala Ser Asp 
20 " 25 30 

Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Lys He Ser Lys 
35 40 45 

Pro Leu Thr Phe Asn Ala Ser Lys Ala Leu Ser Leu Ala He Gly Pro 
50 55 60 

Gly Leu Lys He Gin Asp Gly Lys Leu Val Gly Glu Gly Gin Ala He 
65 70 75 80 

Leu Ala Asn Leu Pro Leu Gin He Thr Asn Asn Thr He Ser Leu Arg 
85 90 95 

Phe Gly Asn Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr 
100 105 110 

Leu Lys Ser Ser Ser Pro Leu Lys He Thr Asp Gin Thr Leu Ser Leu 
115 120 125 

Asn He Gly Asp Ser Leu Ala He Lys Asp Asp Lys Leu Glu Ser Ala 
130 " " 135 140 
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Leu Gin Ala Thr Leu Pro Leu Ser lie Ser Asn Asn Thr lie Ser Leu 
145 150 155 160 

Asn Val Gly Thr Gly Leu Thr He Asn Gly Asn Val Leu Gin Ala Val 
165 170 175 

Pro Leu Asn Ala Leu Ser Pro Leu Thr He Ser Asn Asn Asn He Ser 
180 185 190 

Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn Glu Leu Gin Ser 
195 200 205 

Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn Ser Asn Asn Ser He 
210 215 220 

Ser Leu Asn Thr Leu Ser Pro Phe Arg He Glu Asn Gly Phe Leu Thr 
225 230 235 " 240 

Leu Tyr Leu Gly Thr Lys Ser Gly Leu Leu Val Gin Asn Ser Gly Leu 
245 250 255 

Lys Val Gin Ala Gly Tyr Gly Leu Gin Val Thr Asp Thr Asn Ala Leu 
260 265 270 

Thr Leu Arg Tyr Leu Ala Pro Leu Thr He Pro Asp Ser Gly Ser Glu 
275 280 285 

Gin Gly He Leu Lys Val Asn Thr Gly Gin Gly Leu Ser Val Asn Gin 
290 295 300 

Ala Gly Ala Leu Glu Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp 
305 310 315 "* 320 

Asn Lys He Thr Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn 
325 330 335 

Lys Leu Ala Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala 
340 345 ' 350 

Leu Val Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala 
355 360 365 

Asp Pro Ser Pro Asn Cys Thr He He Gin Asp Arg Asp Thr Lys Phe 
370 375 380 

Thr Leu Ala Leu Thr He Ser Gly Ser Gin Val Leu Gly Thr Val Ser 
385 390 395 400 

He He Gly Val Lys Gly Pro Leu Ser Ser Ser He Pro Ser Ala Thr 
405 410 415 

Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu Leu Ser Ser 
420 425 430 



Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly Pro Ser He Asp 
435 440 445 
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Pro Tyr Pro lie lie Asn Ala Leu Asn Phe Met Pro Asn Ser Leu Ala 
450 455 460 

Tyr Pro Pro Gly Gin Glu lie Gin Ala Lys Cys Asn Met Tyr Val Ser 
465 470 * 475 480 

Thr Phe Leu Arg Gly Asn Pro Gin Arg Pro lie Val Leu Asn He Thr 
485 490 495 

Phe Asn Asn Gin Thr Ser Gly Phe Ser He Arg Phe Thr Trp Thr Asn 
500 505 510 

Leu Thr Thr Gly Glu Ala Phe Ala Met Pro Pro Cys Thr Phe Ser Tyr 
515 520 525 

He Ala Glu Gin Gin 
530 



<210> 


38 


<211> 


50 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


oligomer SV25T 


<400> 


38 



aatttaaata cgtagcgcac tagtcgcgct aagcgcggat atcatttaaa 50 



<210> 


39 


<211> 


49 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


oligomer SV25B 


<400> 


39 



tatttaaatg atatccgcgc ttaagcgcga ctagtgcgct acgtattta 49 
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